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TITLE OF THE INVENTION 

HEPATITIS C VIRUS REPUCONS AND REPUCON ENHANCED CELLS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 The present application claims priority to U.S. Serial No. 60/263,479, 

filed January 23, 2001, hereby incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

The references cited in the present application are not admitted to be 

10 prior art to the claimed invention. 

It is estimated that about 3% of the world's population are infected 
with the Hepatitis C virus (HCV). (Wasley, et al, 2000. Semin. Liver Dis. 20, 1-16.) 
Exposure to HCV results in an overt acute disease in a small percentage of cases, 
while in most instances the virus establishes a chronic infection causing liver 

15 inflammation and slowly progresses into liver failure and cirrhosis. (Iwarson, 1994. 
FEMS Microbiol Rev. 14, 201-204.) In addition, epidemiological surveys indicate an 
important role of HCV in the pathogenesis of hepatocellular carcinoma. (Kew, 1994. 
FEMS Microbiol. Rev. 14, 211-220, Alter, 1995. Blood 85, 1681-1695.) 

The HCV genome consists of a single strand RNA of about 9.5 kb in 

20 length, encoding a precursor polyprotein of about 3000 amino acids. (Choo, et al, 
1989. Science 244, 362-364, Choo, etal, 1989. Science 244, 359-362, Takamizawa, 
etal, 1991. J. Virol 65, 1105-1113.) The HCV polyprotein contains the viral 
proteins in the order: C-El-E2-p7-NS2-NS3-NS4A-NS4B-NS5A-NS5B. 

Individual viral proteins are produced by proteolysis of the HCV 

25 polyprotein. Host cell proteases release the putative structural proteins C, El, E2, and 
p7, and create the N-terminus of NS2 at amino acid 810. (Mizushima, et al, 1994. /. 
Virol 68, 2731-2734, Hijikata, etal, 1993. RNA.S. USA 90, 10773-10777.) 

The non-structural proteins NS3, NS4A, NS4B, NS5A and NS5B 
presumably form the virus replication machinery and are released from the 

30 polyprotein. A zinc-dependent protease associated with NS2 and the N-terminus of 
NS3 is responsible for cleavage between NS2 and NS3. (Grakoui, et ai, 1993. 7. 
Virol 67, 1385-1395, Hijikata, etal, 1993. P.NA.S. USA 90, 10773-10777.) A 
distinct serine protease located in the N-terminal domain of NS3 is responsible for 
proteolytic cleavages at the NS3/NS4A, NS4A/NS4B, NS4B/NS5A and NS5A/NS5B 

35 junctions. (Barthenschlager, et al, 1993. J. Virol 67, 3835-3844, Grakoui, et al. 
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1993. Proc. Natl. Acad. Sci. USA 90, 10583-10587, Tomei, et al, 1993. /. Virol. 67, 
4017-4026.) NS4A provides a cofactor for NS3 activity. (Failla, et al, J. Virol 1994. 
68, 3753-3760, De Francesco, et al, U.S. Patent No. 5,739,002.) NS5A is a highly 
phosphorylated protein concurring interferon resistance. (De Francesco, et al, 2000. 
5 Semin Liver Dis., 20(1), 69-83, Pawlotsky, 1999. J. Viral Hepat. Suppl. 7, 47-48.) 
NS5B provides an RNA polymerase. (De Francesco, et al., International Publication 
Number WO 96/37619, Behrens, et al, 1996. EMBO 75, 12-22, Lohmann, et al., 
1998. Virology 249, 108-118.) 

Lohmann, et al., Science 285 , 110-113, 1999, illustrates the ability of a 

10 biscistronic HCV replicon to replicate in a hepatoma cell line. The biscistonic HCV 
replicon contained a neomycin cistron and an NS2-NS5B or an NS3-NS5B cistron. 
"NS2-NS5B" refen; to a NS2-NS3-NS4A-NS4B-NS5A-NS5B polyprotein. "NS3- 
NS5B" refere to a NS3-NS4A-NS4B-NS5A-NS5B polyprotein. 

Bartenschlager, European Patent Application 1 043 399, published 

15 October 11, 2000 (not admitted to be prior art to the claimed invention), describes a 
cell culture system for autonomous HCV RNA replication and protein expression. 
Replication and protein expression is indicated to occur in sufficiently large amounts 
for quantitative determination. European Patent Application 1 043 399 indicates that 
prior cell lines or primary cell cultures infected with HCV do not provide favorable 

20 circumstances for detecting HCV replication. 

SUMMARY OF THE INVENTION 

The present invention features nucleic acid containing one or more 
adaptive mutations, and HCV replicon enhanced cells. Adaptive mutations are 

25 mutations that enhance HCV replicon activity. HCV replicon enhanced cells are cells 
having an increased ability to maintain an HCV replicon. 

An HCV replicon is an RNA molecule able to autonomously replicate 
in a cultured cell and produce detectable levels of one or more HCV proteins. The 
basic subunit of an HCV replicon encodes for a HCV NS3-NS5B polyprotein along 

30 with a suitable 5' UTR-partial core (PC) region and 3' UTR. The 5' UTR-PC region 
is made up of a 5'UTR region and about 36 nucleotides of the beginning of the core. 
Additional regions may be present including those coding for HCV proteins or 
elements such as the complete core, El, E2, p7 or NS2; and those coding for other 
types of proteins or elements such as a encephalomyocarditis virus (EMCV) internal 

35 ribosome entry site (IRES), a reporter protein or a selection protein. 
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The present application identifies different adaptive mutations that 
enhance HCV replicon activity. Enhancing replicon activity brings about at least one 
of the following: an increase in replicon maintenance in a cell, an increase in replicon 
replication, and an increase in replicon protein expression. 
5 Adaptive mutations are described herein by identifying the location of 

the adaptive mutation with respect to a reference sequence present in a particular 
region. Based on the provided reference sequence, the same adaptive mutation can be 
produced in corresponding locations of equivalent regions having an amino acid 
sequence different than the reference sequence. Equivalent regions have the same 

10 function or encode for a polypeptide having the same function. 

Replicon enhanced cells are a preferred host for the insertion and 
expression of an HCV replicon. Replicon enhanced cells are initially produced by 
creating a cell containing a HCV replicon and then curing the cell of the replicon. 
The term "replicon enhanced cell" includes cells cured of HCV replicons and progeny 

15 of such cells. 

Thus, a first aspect of the present invention describes a nucleic acid 
molecule comprising at least one of the following regions: an altered NS3 encoding 
region, an altered NS5A encoding region, and an altered EMCV IRES region. The 
altered region contains one or more adaptive mutations. Reference to the presence of 

20 particular adaptive mutation(s) does not exclude other mutations or adaptive 

mutations from being present. Adaptive mutations are described with reference to 
either an encoded amino acid sequence or a nucleic acid sequence. 

A nucleic acid molecule can be single-stranded or part of a double 
strand, and can be RNA or DNA. Depending upon the structure of the nucleic acid 

25 molecule, the molecule may be used as a replicon or in the production of a replicon. 
For example, single-stranded RNA having the proper regions can be a replicon, while 
double-stranded DNA that includes the complement of a sequence coding for a 
replicon or replicon intermediate may useful in the production of the replicon or 
replicon intermediate. 

30 Preferred nucleic acid molecules are those containing region(s) from 

SEQ. ED. NOs. 1, 2, or 3, or the RNA version thereof, with one or more adaptive 
mutations. Reference to "the RNA version thereof indicates a ribose backbone and 
the presence of uracil instead of thymine. 

The presence of a region containing an adaptive mutation indicates that 

35 at least one such region is present. In different embodiments, for example, adaptive 
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mutations described herein are present at least in the NS3 region, in the NS5A region, 
in the NS3 and NS5A regions, in the EMCV IRES and NS3 regions, in the EMCV 
and NS5A regions, and in the ECMV IRES, NS3 and NS5A regions. 

Another aspect of the present invention describes an expression vector 
5 comprising a nucleotide sequence of an HCV replicon or replicon intermediate 

coupled to an exogenous promoter. Reference to a nucleotide sequence "coupled to 
an exogenous promoter" indicates the presence and positioning of an RNA promoter 
such that it can mediate transcription of the nucleotide sequence and that the promoter 
is not naturally associated with the nucleotide sequence being transcribed. The 

10 expression vector can be used to produce RNA replicons. 

Another aspect of the present invention describes a recombinant 
human hepatoma cell. Reference to a recombinant cell includes an initially produced 
cell and progeny thereof. 

Another aspect of the present invention describes a method of making 

15 a HCV replicon enhanced cell. The method involves the steps of: (a) introducing and 
maintaining an HCV replicon into a cell and (b) curing the cell of the HCV replicon. 

Another aspect of the present invention describes an HCV replicon 
enhanced cell made by a process comprising the steps of: (a) introducing and 
maintaining an HCV replicon into a cell and (b) curing the cell of the HCV replicon. 

20 Another aspect of the present invention describes a method of making 

a HCV replicon enhanced cell comprising an HCV replicon. The method involves (a) 
introducing and maintaining a first HCV replicon into a cell, (b) curing the cell of the 
replicon, and (c) introducing and maintaining a second replicon into the cured cell, 
where the second replicon may be the same or different as the first replicon. 

25 Another aspect of the present invention describes an HCV replicon 

enhanced cell containing a HCV replicon made by the process involving the step of 
introducing an HCV replicon into an HCV replicon enhanced cell. The HCV replicon 
1 introduced into the HCV replicon enhanced cell may be the same or different than the 
HCV replicon used to produce the HCV replicon enhanced cell. In a preferred 

30 embodiment, the HCV replicon introduced into an HCV replicon enhanced cell is the 
same replicon as was used to produce the enhanced cell. 

Another aspect of the present invention describes a method of 
measuring the ability of a compound to affect HCV activity using an HCV replicon 
comprising an adaptive mutation described herein. The method involves providing a 

35 compound to a cell comprising' the HCV replicon and measuring the ability of the 
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compound to affect one or more replicon activities as a measure of the effect on HCV 
activity. 

Another aspect of the present invention describes a method of 
measuring the ability of a compound to affect HCV activity using an HCV replicon 
5 enhanced cell that comprises an HCV replicon. The method involves providing a 
compound to the cell and measuring the ability of the compound to effect one or more 
replicon activities as a measure of the effect on HCV activity. 

Other features and advantages of the present invention are apparent 
from the additional descriptions provided herein including the different examples. 
10 The provided examples illustrate different components and methodology useful in 
practicing the present invention. The examples do not limit the claimed invention. 
Based on the present disclosure the skilled artisan can identify and employ other 
components and methodology useful for practicing the present invention. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A- 1 G illustrate the nucleic acid sequence for the 
pHCVNeo. 17 coding strand (SEQ. ID. NO. 3). The different regions of pHCVNeo. 17 
are provided as follows: 

1-341: HCV 5' non-translated region, drives translation of the core-neo fusion protein; 
20 342- 1181: Core-neo fusion protein, selectable marker, 

1 190-1800: Internal ribosome entry site of the encephalomyocarditis virus, drives 
translation of the HCV NS region; 

1801-7755: HCV polyprotein from non-structural protein 3 to non-structural protein 
5B; 

25 1801-3696: Non-structural protein 3 (NS3), HCV NS3 protease/helicase; 
3697-3858: Non-structural protein 4A (NS4A), NS3 protease cofactor, 
3859-4641: Non-stnictural protein 4B (NS4B); 
4642-5982: Non-structural protein 5A (NS5A); 

5983-7755: Non-structural protein 5B (NS5B); RNA-dependent RNA polymerase 
30 7759-7989: HCV 3 f non-translated region; and 

7990-10690 plasmid sequences comprising origin of replication, beta lactamase 
coding sequence, and T7 promoter. 
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DETAILED DESCRIPTION OF THE INVENTION 

HCV replicons and HCV replicon enhanced cells can be used to 
produce a cell culture providing detectable levels of HCV RNA and HCV protein. 
HCV replicons and HCV replicon enhanced hosts can both be obtained by selecting 
5 for the ability to maintain an HCV replicon in a cell. As illustrated in the examples 
provided below, adaptive mutations present in HCV replicons and host cells can both 
assist replicon maintenance in a cell. 

The detectable replication and expression of HCV RNA in a cell 
culture system has a variety of different uses including being used to study HCV 
10 replication and expression, to study HCV and host cell interactions, to produce HCV 
RNA, to produce HCV proteins, and to provide a system for measuring the ability of a 
compound to modulate one or more HCV activities. 

Preferred cells for use with a HCV replicon are Huh-7 cells and Huh-7 
derived cells. "Huh-7 derived cells" are cell produced starting with Huh-7 cells and 
15 introducing one or more phenotypic and/or genotypic modifications. 

Adaptive Mutations 
Adaptive mutations enhance the ability of an HCV replicon to be 
maintained and expressed in a host cell. Adaptive mutations can be initially selected 
20 for using a wild type HCV RNA construct or a mutated HCV replicon. Initial 

selection involves providing HCV replicons to cells and identifying clones containing 
a replicon. 

Nucleic acid sequences of identified HCV replicons can be determined 
using standard sequencing techniques. Comparing the sequence of input HCV 

25 constructs and selected constructs provides the location of mutations. The effect of 
particular mutation(s) can be measured by, for example, producing a construct to 
contain particular mutation(s) and measuring the effect of these mutation(s). Suitable 
control constructs for comparison purposes include wild type constructs and 
constructs previously evaluated. 

30 Adaptive mutations were predominantly found in the HCV NS3 and 

NS5A regions. With the exception of two silent mutations in NS5A and NS5B, 
consensus mutations occurring in the NS region resulted in changes to the deduced 
amino acid sequence. Noticeably, the amino acid changes occurred in residues that 
are conserved in all or a large number of natural HCV isolates. HCV sequences are 

35 well known in the art and can be found, for example, in GenBank. 
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Adaptive mutations described herein can be identified with respect to a 
reference sequence. The reference sequence provides the location of the adaptive 
mutation in, for example, the NS3 or NSSA RNA, cDNA, or amino acid sequence. 
The remainder of the sequence encodes for a functional protein that may have the 
S same, or a different, sequence than the reference sequence. 

Preferred NS3 and NSSA adaptive mutations and examples of changes 
that can be made to produce such mutations are shown in Tables 1 and 2. The amino 
acid numbering shown in Tables 1 and 2 is with respect to SEQ. ID. NO. 1. The 
nucleotide numbering shown in Tables 1 and 2 is with respect to SEQ. ID. NO. 2. 
10 SEQ. ID. NO. 1 provides the amino acid sequence of the Conl HCV isolate 

(Accession Number AJ238799). SEQ. ID. NO. 2 provides the nucleic acid sequence 
of the Conl HCV isolate. 

TABLE 1 

15 



Preferred NS3 Ac 


aptive Mutations 


Amino Acid 


Nucleotide 


gly!095ala 


G3625C 


glul202gly 


A3946G 


alal347thr 


G4380A 



TABLE 2 



Preferred NSSA Adaptive Mutations 


Amino Acid 


Nucleotide 


Lys@2039 


AAA@6458 


asn2041thr 


A6463C 


ser2173phe 


C6859T 


ser2197phe 


C6931T 


Ieu2198ser 


T6934C 


ala2199thr 


G6936A 


ser2204arg 


C6953A(orG) 



@" refers to an addition. 



7 



WO 02/059321 



PCT/EP02/00526 



Preferred adaptive mutations identified with respect to a reference 
sequence can be produced changing the encoding region of SEQ. ID. NO. 1, or an 
equivalent sequence, to result in the indicated change. Preferred adaptive mutations 
5 provided in Tables 1 and 2 occur in amino acids conserved among different HCV 
isolates. 

Adaptive mutations have different effects. Some mutations alone, or 

in combination with other mutations, enhance HCV replicon activity. In some cases, 

two or more mutations led to synergistic effects and in one case, a slightly 
10 antagonistic effect was observed. 

An adaptive mutation once identified can be introduced into a starting 

construct using standard genetic techniques. Examples of such techniques are 

provided by Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987 

1998, and Sambrook, et aL 9 Molecular Cloning, A Laboratory Manual, 2 nd Edition, 
15 Cold Spring Harbor Laboratory Press, 1989. 

HCV replicons containing adaptive mutations can be built around an 

NS3 region or NS5A region containing one or more adaptive mutations described 

herein. The final replicon will contain replicon components needed for replication 

and may contain additional components. 
20 SEQ. ID. NO. 2 can be used as a reference point for different HCV 

regions as follows: 

5' UTR- nucleotides 1-341; 

Core- nucleotides 342-914; 

El- nucleotides 915-1490; 
25 E2- nucleotides 1491-2579; 

P7- nucleotides 2580-2768; 

NS2- nucleotides 2769-3419; 

NS3- nucleotides 3420-5312; 

NS4A- nucleotides 5313-5474; 
30 NS4B- nucleotides 5475-6257; 

NS5A- nucleotides 6258-7598; 

NS5B- nucleotides 7599-9371; and 

3' UTR- nucleotides 9374-9605. 

The amino acid sequences of the different structural and non-structural regions is 
35 provided by SEQ. ID. NO. 1 . 
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Nucleic acid sequences encoding for a particular amino acid can be 
produced taking into account the degeneracy of the genetic code. The degeneracy of 
the genetic code arises because almost all amino acids are encoded for by different 
combinations of nucleotide triplets or "codons". The translation of a particular codon 
5 into a particular amino acid is well known in the art (see, e.g., Lewin GENES /V, p. 
1 19, Oxford University Press, 1990). Amino acids are encoded for by RNA codons as 
follows: 

A=Ala=Alanine: codons GCA, GCC, GCG, GCU 

C=Cys=Cysteine: codons UGC, UGU 
10 D=Asp=Aspartic acid: codons GAC, GAU 

E=Glu=Glutamic acid: codons GAA, GAG 

B=Phe=Phenylalanine: codons UUC, UUU 

G=Gly=Glycine: codons GGA, GGC, GGG, GGU 

H=His=Histidine: codons CAC, CAU 
15 I=Ile=Isoleucine: codons AUA, AUC, AUU 

K=Lys=Lysine: codons AAA, AAG 

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU 
M=Met=Methionine: codon AUG 
N=Asn=Asparagine: codons AAC, AAU 
20 P=Pro=Proline: codons CCA, CCC, CCG, CCU 
Q=Gln=Glutamine: codons CAA, CAG 

R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU 

S=Sen=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU 

T=Thr=Threonine: codons ACA, ACC, ACG, ACU 
25 V=Val=Valine: codons GUA, GUC, GUG, GUU 

W=Trp=Tryptophan: codon UGG 

Y=Tyr^=Tyrosine: codons UAC, UAU. 

Constructs, including subgenomic and genomic replicons, containing 

one or more of the adaptive mutations described herein can also contain additional 
30 mutations. The additional mutations may be adaptive mutations and mutations not 

substantially inhibiting replicon activity. Mutations not substantially inhibiting 

replicon activity provide for a replicon that can be introduced into a cell and have 

detectable activity. 
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HCV Replicon 

HCV replicons include the full length HCV genome and subgenomic 
constructs. A basic HCV replicon is a subgenomic construct containing an HCV 5' 
UTR- PC region, an HCV NS3-NS5B polyprotein encoding region, and a HCV 3' 
5 UTR. Other nucleic acid regions can be present such as those providing for HCV 
NS2, structural HCV protein(s) and non-HCV sequences. 

The HCV 5* UTR-PC region provides an internal ribosome entry site 
(IRES) for protein translation and elements needed for replication. The HCV 5*UTR- 
PC region includes naturally occurring HCV 5' UTR extending about 36 nucleotides 
10 into a HCV core encoding region, and functional derivatives thereof. The 5'-UTR-PC 
region can be present in different locations such as site downstream from a sequence 
encoding a selection protein, a reporter, protein, or an HCV polyprotein. 

Functional derivatives of the S'-UTR-PC region able to initiate 
translation and assist replication can be designed taking into structural requirements 
15 for HCV translation initiation. (See, for example, Honda, et al., 1996. Virology 222, 
31-42). The affect of different modifications to a 5' UTR-PC region can be 
determined using techniques that measure replicon activity. 

In addition to the HCV 5' UTR-PC region, non-HCV IRES elements 
can also be present in the replicon. The non-HCV IRES elements can be present in 
20 different locations including immediately upstream the region encoding for an HCV 
polyprotein. Examples of non-HCV IRES elements that can be used are the EMCV 
IRES, poliovirus IRES, and bovine viral diarrhea virus IRES. 

The HCV 3' UTR assists HCV replication. HCV 3' UTR includes 
naturally occurring HCV 3' UTR and functional derivatives thereof. Naturally 
25 occurring 3' UTR's include a poly U tract and an additional region of about 100 

nucleotides. (Tanaka, et al., 1996. J. Virol. 70, 3307-3312, Kolykhalov, et al, 1996. 
J. Virol. 70, 3363-3371.) At least in vivo, the 3* UTR appears to be essential for 
replication. (Kolykhalov, et al, 2000. J. Virol. 2000 4, 2046-2051.) Examples of 
naturally occurring 3' UTR derivatives are described by Bartenschlager International 
30 Publication Number EP 1 043 399. 

The NS3-NS5B polyprotein encoding region provides for a polyprotein 
that can be processed in a cell into different proteins. Suitable NS3-NS5B polyprotein 
sequences that may be part of a replicon include those present in different HCV 
strains and functional equivalents thereof resulting in the processing of NS3-NS5B to 
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a produce functional replication machinery. Proper processing can be measured for 
by assaying, for example, NS5B RNA dependent RNA polymerase. 

The ability of an NS5B protein to provide RNA polymerase activity 
can be measured using techniques well known in the art. (See, for example, De 
5 Franscesco, et al 9 International Publication Number WO 96/37619, Behrens, et al t 
1996. £MflO 75:12-22, Lohmann,*/aZ., 1998. Virology 249:108-118.) Preferably, 
the sequence of the active NS5B is substantially similar as that provided in SEQ. ID. 
NO. 1, or a wild type NS5B such as strains HCV-1, HCV-2, HCV-BK, HCV-J, HCV- 
N, HCV-H. A substantially similar sequence provides detectable HCV polymerase 
10 activity and contains 1, 2, 3, 4, 5, 6, 7, 9, 10, 1 1, 12, 13, 14, or 15 amino acid 

alterations to that present in a HCV NS5B polymerase. Preferably, no more than 1, 2, 
3, 4 or 5 alterations are present 

Alterations to an amino acid sequence provide for substitution(s), 
insertion(s), deletion(s) or a combination thereof. Sites of different alterations can be 
IS designed taking into account the amino acid sequences of different NS5B polymerases 
to identify conserved and variable amino acid, and can be empirically determined. 

HCV replicons can be produced in a wide variety of different cells and 
in vitro. Suitable cells allow for the transcription of a nucleic acid encoding for an 
HCV replicon. 

20 

Additional Sequences 
An HCV replicon may contain non-HCV sequences in addition to 
HCV sequences. The additional sequences should not prevent replication and 
expression, and preferably serve a useful function. Sequences that can be used to 
25 serve a useful function include a selection sequence, a reporter sequence, transcription 
elements and translation elements. 

Selection Sequence 

A selection sequence in an HCV replicon facilitates the identification 
30 of a cell containing the replicon. Selection sequences are typically used in 

conjunction with some selective pressure that inhibits growth of cells not containing 
the selection sequence. Examples of selection sequences include sequences encoding 
for antibiotic resistance and ribozymes. 

Antibiotic resistance can be used in conjunction with an antibiotic to 
35 select for cells containing replicons. Examples of selection sequences providing for 
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antibiotic resistance are sequences encoding resistance to neomycin, hygromycin, 
puromycin, or zeocin. 

A ribozyme serving as a selection sequence can be used in conjunction 
with an inhibitory nucleic acid molecule that prevents cellular growth. The ribozyme 
5 recognizes and cleaves the inhibitory nucleic acid. 

Reporter Sequence 

A reporter sequence can be used to detect replicon replication or 
protein expression. Preferred reporter proteins are enzymatic proteins whose presence 

10 can be detected by measuring product produced by the protein. Examples of reporter 
proteins include, iuciferase, beta-lac tarn ase, secretory alkaline phosphatase, beta- 
glucuronidase, green fluorescent protein and its derivatives. In addition, a reporter 
nucleic acid sequence can be used to provide a reference sequence that can be targeted 
by a complementary nucleic acid. Hybridization of the complementary nucleic acid to 

15 its target can be determined using standard techniques. 

Additional Sequence Configuration 

Additional non-HCV sequences are preferable 5* or 3' of an HCV 
replicon genome or subgenomic genome region. However, the additional sequences 
20 can be located within an HCV genome as long as the sequences do not prevent 

detectable replicon activity. If desired, additional sequences can be separated from 
the replicon by using a ribozyme recognition sequence in conjunction with a 
ribozyme. 

Additional sequences can be part of the same cistron as the HCV 
25 polyprotein or can be a separate cistron. If part of the same cistron, the selection or 
reporter sequence coding for a protein should result in a product that is either active as 
a chimeric protein or is cleaved inside a cell so it is separated from HCV protein. 

Selection and reporter sequences encoding for a protein when present 
as a separate cistron should be associated with elements needed for translation. Such 
30 elements include a 5* IRES. 

Detection Methods 
Methods for detecting replicon activity include those measuring the 
production or activity of replicon RNA and encoded for protein. Measuring includes 
35 qualitative and quantitative analysis. 
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Techniques suitable for measuring RNA production include those 
detecting the presence or activity of RNA. The presence of RNA can be detected 
using, for example, complementary hybridization probes or quantitative PGR. 
Techniques for measuring hybridization between complementary nucleic acid and 
S quantitative PCR are well known in the art. (See for example, Ausubel, Current 

Protocols in Molecular Biology, John Wiley, 1987-1998, Sambrook, et a/., Molecular 
Cloning, A Laboratory Manual, 2 nd Edition, Cold Spring Harbor Laboratory Press, 
1989, and U.S. Patent No. 5,731,148.) 

RNA enzymatic activity can be provided to the replicon by using a 

10 ribozyme sequence. Ribozyme activity can be measured using techniques detecting 
the ability of the ribozyme to cleave a target sequence. 

Techniques for measuring protein production include those detecting 
the presence or activity of a produced protein. The presence of a particular protein 
can be determined by, for example, immunological techniques. Protein activity can 

15 be measured based on the activity of an HCV protein or a reporter protein sequence. 

Techniques for measuring HCV protein activity vary depending upon 
the protein that is measured. Techniques for measuring the activity of different non- 
structural proteins such as NS2/3, NS3, and NS5B, are well known in the art. (See, 
for example, references provided in the Background of the Invention.) 

20 Assays measuring replicon activity also include those detecting virion 

production from a replicon that produces a virion; and those detecting a cytopathic 
effect from a replicon producing proteins exerting such an effect. Cytopathic effects 
can be detected by assays suitable to measure cell viability. 

Assays measuring replicon activity can be used to evaluate the ability 

25 of a compound to modulate HCV activities. Such assays can be carried out by 
providing one or more test compounds to a cell expressing an HCV replicon and 
measuring the effect of the compound on replicon activity. If a preparation containing 
more than one compound is found to modulate replicon activity, individual 
compounds or smaller groups of compounds can be tested to identify replicon active 

30 compounds. 

Compounds identified as inhibiting HCV activity can be used to 
produce replicon enhanced cells and may be therapeutic compounds. The ability of a 
compound to serve as a therapeutic compound can be confirmed using animal models 
such as a chimpanzee to measure efficacy and toxicity. 

35 
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Replicon Enhanced Host Cell 
Replicon enhanced cells are initially produced by selecting for a cell 
able to maintain an HCV replicon and then curing the cell of the replicon. Cells 
produced in this fashion were found to have an increased ability to maintain a replicon 
5 upon subsequent HCV replicon transfection. 

Initial transfection can be performed using a wild-type replicon or a 
replicon containing one or more adaptive mutations. If a wild-type replicon is 
employed, the replicon should contain a selection sequence to facilitate replicon 
maintenance. 

10 Cells can be cured of replicons using different techniques such as those 

employing replicon inhibitory agent. In addition, replication of HCV replicons is 
substantially reduced in confluent cells. Thus, it is conceivable to cure cells of 
replicons by culturing them at a high density. 

Replicon inhibitory agents inhibit replicon activity or select against a 

15 cell containing a replicon. An example of such an agent is IFN-a. Other HCV 
inhibitory compounds may also be employed. HCV inhibitor compounds are 
described, for example, in Llinas-Brunet, et ai, 2000. Bioorg Med Chem. Lett 10(20), 
2267-2270. 

The ability of a cured cell to be a replicon enhanced cell can be 
20 measured by introducing a replicon into the cell and determining efficiency of 
subsequent replicon maintenance and activity. 

EXAMPLES 

Examples are provided below to further illustrate different features of 
25 the present invention. The examples also illustrate useful methodology for practicing 
the invention. These examples do not limit the claimed invention. 

Example 1: Techniques 

This example illustrates the techniques employed for producing and 
30 analyzing adaptive mutations and replicon enhanced cells. 

Manipulation of Nucleic Acids and Construction of Recombinant Plasmids 

Manipulation of nucleic acids was done according to standard 
protocols. (Sambrook, et aU 1989. Molecular Cloning: A Laboratory Manual, 2 nd ed. 
35 Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.) Plasmid DNA was 
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prepared from ON culture in LB broth using Qiagen 500 columns according to 

manufacturer instructions. 

Plasmids containing desired mutations were constructed by restriction 

digestion using restriction sites flanking the mutations or by PCR amplification of the 
5 area of interest, using synthetic oligonucleotides with the appropriate sequence. Site 

directed mutagenesis was carried out by inserting the mutations in the PCR primers. 

PCR amplification was performed using high fidelity thermostable polymerases or 

mixtures of polymerases containing a proofreading enzyme. (Barnes, et al. p 1994. 

Proc, Natl Acad. Sci. 91 , 2216-2220.) All plasmids were verified by restriction 
10 mapping and sequencing. 

pHCVneol7.wt contains the cDNA for an HCV bicistronic replicon 

identical to replicon I 3 77neo/NS3-37wt described by Bartenschlager (SEQ. ID. NO. 3) 

(Lohmann, et al. 9 1999. Science 285,1 10-1 13, EMBI^genbank No. AJ242652). The 

plasmid comprises the following elements: 5* untranslated region of HCV comprising 
15 the HCV-IRES and part of the core (ntl-377); neomycin phosphotransferase coding 

sequence; and EMCV IRES; HCV coding sequences from NS3 to NS5B; 3' UTR of 

HCV. 

Plasmid pHCVNeol?.GAA is identical to pHCVNeo.17, except that 
the GAC triplets (nt. 6934-6939 of pHCVNeol7 sequence) coding for the catalytic 
20 aspartates of the NS5B polymerase (amino acids 2737 and 2738 of HCV polyprotein) 
were changed into GCG, coding for alanine. 

Plasmid pHCVNeol7.m0 is identical to pHCVNeol7, except that the 
triplet AGC (nt. 5335-5337 of pHCVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2204 of HCV polyprotein) was changed into AGA, coding for 
25 arginine. 

Plasmid pHCVNeol7.ml is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine. 

30 Plasmid pHCVNeol7.m2 is identical to pHCVNeol7, except that the 

triplet TCC (nt. 5242-5244 of pHCVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2173 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. 
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Plasmid pHCVNeol7.m3 is identical to pHCVNeoH, except that the 
triplet TCC (nt. 5314-5316 of pHCVNeoH sequence) coding for the serine of NS5A 
protein (amino acid 2197 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. 

5 Plasmid pHCVNeol7.m4 is identical to pHCVNeol7, except that the 

triplet TTG (nt. 5317-5319 of pHCVNeoH sequence) coding for the leucine of NS5 A 
protein (amino acid 2198 of HCV polyprotein) was changed into TCG, coding for 
serine. 

Plasmid pHCVNeol7.m5 is identical to pHCVNeoH, except that an 
10 extra triplet AAA coding for lysine was inserted after the triplet GTG (nt. 4840-4843 
of pHCVNeoH sequence), coding for valine 2039 of HCV polyprotein. 

Plasmid pHCVNeol7.m6 is identical to pHCVNeoH, except that the 
triplets GAA and GCC (nt. 2329-2331 and 2764-2766 of pHCVNeoH sequence) 
coding for the glutamic acid and the alanine of NS3 protein (amino acid 1202 and 
15 1347 of HCV polyprotein) were changed respectively into GGA and ACC, coding for 
glycine and threonine. The triplet TCC (nt. 5242-5244 of pHCVNeoH sequence) 
coding for the serine of NS5A protein (amino acid 2173 of HCV polyprotein) was 
changed into TTC, coding for phenylalanine; an extra adenosine was inserted into the 
EMCV IRES (after the thymidine 1736 of the neplicon sequence). 
20 Plasmid pHCVNeol7.m7 is identical to pHCVNeoH, except that the 

triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5242-5244 of pHCVNeoH sequence) coding for 
the serine of NS5A protein (amino acid 2173 of HCV polyprotein) was changed into 
25 TTC, coding for phenylalanine. 

Plasmid pHCVNeo!7.m8 is identical to pHCVNeoH, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeoH sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5314-5316 of pHCVNeoH sequence) coding for 
30 the serine of NS5A protein (amino acid 2197 of HCV polyprotein) was changed into 
TTC, coding for phenylalanine. 

Plasmid pHCVNeoI7.m9 is identical to pHCVNeoH, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
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for threonine; the triplet TTG (nt. 5317-5319 of pHCVNeol7 sequence) coding for 
the leucine of NS5A protein (amino acid 2198 of HCV polyprotein) was changed into 
TCG, coding for serine. 

Plasmid pHCVNeolZmlO is identical to pHCVNeol7, except that the 
5 triplet GAA (nt 2329-233 1 of pHC VNeol7 sequence) coding for the glutamic acid of 
NS3 protein (amino acid 1202 of HCV polyprotein) was changed into GGA, coding 
for glycine; an extra triplet AAA coding for lysine was inserted after the triplet GTG 
(nt. 4840-4843 of pHCVNeol7 sequence), coding for valine 2039 of HCV 
polyprotein. 

10 Plasmid pHCVNeol7.ml 1 is identical to pHCVNeol7, except that the 

triplet TCC (nt. 5314-5316 of pHCVNeol7 sequence) coding for the serine of NS5A 
protein (amino acid 2197 of HCV polyprotein) was changed into TTC, coding for 
phenylalanine. The triplet GCC (nt. 5320-5322 of pHCVNeol7 sequence) coding for 
the alanine of NS5A protein (amino acid 2199 of HCV polyprotein) was changed into 

1 5 ACC coding for threonine. 

Plasmid pHCVNeol7.ml2 is identical to pHCVNeol7, except that the 
triplet AAC (nt. 4846-4848 of pHCVNeol7 sequence) coding for the asparagine of 
NS5 A protein (amino acid 2041 of HCV polyprotein) was changed into ACC, coding 
for threonine; the triplet TCC (nt. 5314-5316 of pHCVNeol7 sequence) coding for 

20 the serine of NS5A protein (amino acid 2197 of HCV polyprotein) was changed into 
TTC, coding for phenylalanine. The triplet GCC (nt. 5320-5322 of pHCVNeol7 
sequence) coding for the alanine of NS5A protein (amino acid 2199 of HCV 
polyprotein) was changed into ACC coding for threonine. 

Plasmid pHCVNeol7.ml3 has the same mutations as 

25 pHC VNeol7.m8, but also an extra adenosine inserted into the EMCV IRES (after the 
thymidine 1736 of the replicon sequence). 

Plasmid pHCVNeol7.ml4 has the same mutations as 
pHCVNeol7.ml 1, but also an extra adenosine inserted into the EMCV IRES (after 
the thymidine 1736 of the replicon sequence). 

30 Plasmid pHCVNeol7.ml5 is identical to pHCVNeol7, except that the 

triplet GCC (nt. 5320-5322 of pHCVNeol7 sequence) coding for the alanine of 
NS5A protein (amino acid 2199 of HCV polyprotein) was changed into ACC coding 
for threonine. 
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Plasmid pRBSEAP.5 is a pHCVNeo.17 derivative where the Neo 
coding sequence has been replaced with the sequence coding for the human placental 
alkaline phosphatase corresponding to nucleotides 90-1580 of pBC12/RSV/SEAP 
plasmid. (Berger, etal., 1988. Gene 66, 1-10.) 

5 

RNA Transfection 

Transfection was performed using Huh-7 cells. The cells were grown 
in Dulbecco's modified minimal essential medium (DMEM, Gibco, BRL) 
supplemented with 10% FCS. For routine work, cells were passed 1 to 5 twice a 

10 week using Ix trypsin/EDTA (Gibco, BRL). 

Plasmids were digested with the Seal endonuclease (New England 
Biolabs) and transcribed in vitro with the T7 Megascript kit (Ambion). Transcription 
mixtures were treated with DNase I (0. 1 U/ml) for 30 minutes at 37°C to completely 
remove template DNA, extracted according to the procedure of Chomczynski 

15 (Chomczynski, et al. t 1987. Anal Biochenu 162, 156-159), and resuspended with 
RNase-free phosphate buffered saline (rfPBS, Sambrook, et al, 1989. Molecular 
Cloning: A Laboratory Manual, 2 nd ed. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N. Y.). 

RNA transfection was performed as described by Liljestrom, et al. t 
20 1991. J. Virol 6, 4107-41 13, with minor modifications. Subconfluent, actively 
growing cells were detached from the tissue culture container using trypsin/EDTA. 
Trypsin was neutralised by addition of 3 to 10 volumes of DMEM/10%FCS and cells 
were centrifuged for 5 minutes at 1200 rpm in a Haereus table top centrifuge at 4<>C. 
Cells were resuspended with ice cold rfPBS by gentle pipetting, counted with a 
25 haemocitometer, and centrifuged as above. rfPBS wash was repeated once and cells 
were resuspended at a concentration of 1-2 x 10 7 cell/ml in rfPBS. Aliquots of cell 
suspension were mixed with RNA in sterile eppendorf tubes. The RNA/cell mixture 
was immediately transferred into the electroporation cuvette (precooled on ice) and 
pulsed twice with a gene pulser apparatus equipped with pulse controller (Biorad). 
30 Depending on the experiment, 0. 1 , 0.2 or 0.4 cm electrode gap cuvettes were used, 
and settings adjusted (Table 3). 
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TABLE 3 



Cuvette 


Volume 


Voltage 


Capacitance 


Resistance 


RNA 


Rap (cm) 


(Hi) 


(Volts) 


(uFa) 


(ohm) 




0.1 


70 


200 


25 


infinite 


1-10 


0.2 


200 


400 


25 


infinite 


5-20 


0.4 


800 


800 


25 


infinite 


15-100 



After the electric shock, cells were left at room temperature for 1-10 
5 minutes (essentially the time required to electroporate all samples) and subsequently 
diluted with at least 20 volumes of DMEM/10%FCS and plated as required for the 
experiment. Survival and transfection efficiency were monitored by measuring the 
neutral red uptake of cell cultured for various days in the absence or in the presence of 
neomycin sulfate (G418). With these parameters, survival of Huh-7 cells was usually 
10 40-60% and transfection efficiency ranged between 40% and 100%. 

Sequence Analysis of Replicon RNAs 

The entire NS region was recloned from 3 different transfection 
experiments performed with HCVNeo.17 RNA. RNA was extracted from selected 
15 clones either using the Qiagen RNAeasy minikit following manufacturer instructions 
or as described by Chomczynski, et a/., 1987. Anal. Biochem. 162, 156-159. 

Replicon RNAs (5 jig of total cellular RNA) were retro-transcribed 
using oligonucleotide HCVG34 (5'- AC ATG ATCTGC AG AG AGGCC AGT-3 * ; SEQ. 
ID. No. 4) and the Superscript II reverse transcriptase (Gibco, BRL) according to 
20 manufacturer instructions, and subsequently digested with 2 U/ml Ribonuclease H 

(Gibco BRL). The cDNA regions spanning from the EMCV IRES to the HCV 3' end 
were amplified by PCR using oligonucleotides HCVG39 (5'- 
G AC ASGCTGTG AT AW ATGTCTCCCCC-3 9 ; SEQ. ID. NO. 5) and CITE3 (5*- 
TGGCTCTCCTCAAGCGTATTC -3'; SEQ. ID. NO. 6) and the LA Taq DNA 
25 polymerase (Takara LA Taq). 

Amplified cDNAs were digested with the Kpnl endonuclease (New 
England Biolabs) and the 5.8 kb fragments were gel purified and ligated to the 5.6 kb 
vector fragment (purified from plasmid pRBSEAP.5 digested with Kpnl) using T4 
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DNA ligase (New England Biolabs) according to manufacturer instructions. Ligated 
DNAs were transformed by electroporation in DH10B or JM1 19 strains of E. coli. 

In the case of NS5A region, total RNA isolated from 3 clones, (HB77, 
HB60 and HB68) was extracted and used for RT-PCR. 5/ig of total RNA plus 20 
5 pmole of AS61 oligo (5 • - ACTCTCTGC AGTC AAGCGGCTC A-3 * , RT antisense 
oligo; SEQ. ID. NO. 7) were heated 5 minutes at 95°C, then DMSO (5% f.c.), DTT 
(10 mM f.c), 1 mM dNTP (1 mM f.c), Ix Superscript buffer (1 x f.c), and 10 u 
Superscript (Gibco) were added to a total volume of 20 /il and incubated 3 hours at 
42°C. 2/il of this RT reaction were used to perform PCR with oligos S39 (5*- 

10 CAGTGGATGAACCGGCTGATA-3\ sense; SEQ. ID. NO. 8) or S41 (5'- 

GGGGCGACGGCATCATGCAAACC-3% sense; SEQ. ID. NO. 9) and B43 (5*- 
CAGGACCTGCAGTCTGTCAAAGG-3\ antisense; SEQ. ID. NO. 10) using 
Elongase Enzyme Mix (Gibco) according the instruction provided by the 
manufacturer. The resulting PCR fragment was cloned in pCR2.1 vector using the 

15 TA Cloning kit (Invitrogen) and transformed in ToplOF bacterial strain. 

Plasmid DNA was prepared from ON culture of the resulting 
ampicillin resistant colonies using Qiagen 500 columns according to manufacturer 
instructions. The presence of the desired DNA insert was ascertained by restriction 
digestion, and the nucleotide sequence of each plasmid was determined by automated 

20 sequencing. Nucleotide sequences and deduced amino acids sequences were aligned 
using the GCG software. 

TaqMan 

TaqMan analysis was typically performed using 10 ng of RNA in a 
25 reaction mix (TaqMan Gold RT-PCR kit, Perkin Elmer Biosystems) either with HCV 
specific oligos/probe (oligo 1: 5'-CGGGAGAGCCATAGTGG-3'; SEQ. ID. NO. 11, 
oligo 2: S'-AGTACCACAAGGCCTTrCG-S'; SEQ. ID. NO. 12, probe: 5'- 
CTGCGG A ACCGGTGAGTACAC-3' ; SEQ. ID. NO. 13) or with human GAPDH 
specific oligos/probe (Pre-Developed TaqMan Assay Reagents, Endogenous Control 
30 Human GAPDH, Part Number 43 10884E, Perkin Elmer Biosystems). PCR was 

performed using a Perkin Elmer ABI PRISM 7700 under the following conditions: 30 
minutes at 48°C (the RT step), 10 minutes at 95°C and 40 cycles: 15 seconds at 95°C 
and 1 minute at 60°C. Quantitative calculations were obtained using the Comparative 
C T Method (described in User Bulletin #2, ABI PRISM 7700 Sequence Detection 
35 System, Applied Biosystem, Dec 1997) considering the level of GAPDH mRNA 
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constant. All calculations of HCV RNA are expressed as fold difference over a 
specific control. 

Antibodies and Immunological Techniques 
5 Mouse monoclonal antibody (anti-NS3 mablOE5/24) were produced 

by standard techniques. (Galfog and Milstein, 1981. Methods in Enzymology 73, 1- 
46.) Purified recombinant protein was used as an immunogen. (Gallinari, et aL y 
1999. Biochemistry 38, 5620-5632.) 

For Cell-ELISA analysis, transfected cells were monitored for 

10 expression of the NS3 protein by ELISA with the anti-NS3 mab 10E5/24. Cells were 
seeded into 96 well plates at densities of 40,000, 30,000, 15,000 and 10,000 cells per 
well and fixed with ice-cold isopropanol at 1, 2, 3 and 4 days post-transfection, 
respectively. The cells were washed twice with PBS, blocked with 5% non-fat dry 
milk in PBS + 0.1% Triton X100 + 0.02% SDS (PBSTS) and then incubated 

15 overnight at 4°C with 10E5/24 mab diluted 1:2000 in Milk/PBSTS. After washing 5 
times with PBSTS, the cells were incubated for 3 hours at room temperature with 
anti-mouse IgG Fc specific alkaline phosphatase conjugated secondary antibody 
(Sigma A-7434), diluted 1:2000 in Milk/PBSTS. After washing again as above, the 
reaction was developed with p-nitrophenyl phosphate disodium substrate (Sigma 104- 

20 105) and the absorbance at 405 nm read at intervals. 

The results were normalized by staining with sulforhodamine B (SRB 
Sigma S 1402) to determine cell numbers. The alkaline phosphatase substrate was 
removed from the wells and the cells washed with PBS. The plates were then 
incubated with 0.4% SRB in 1% acetic acid for 30 minutes (200 nl/well), rinsed 4 

25 times in 1% acetic acid, blotted dry and then 200 nl/well of lOmM Tris pH 10.5 
added. After mixing, the absorbance at 570 nm was read. 

Neutral Red/ Crystal Violet Staining of Foci 

The survival of transfected cells in the absence or presence of G418 

30 was monitored by staining of foci/clones with neutral red in vivo with subsequent 
crystal violet staining. The medium was removed from the cells and replaced with 
fresh medium containing 0.0025% neutral red (Sigma N2889) and the cells incubated 
for 3 hours at 37°C. Cells were washed twice with PBS, fixed in 3.5% formaldehyde 
for 15 minutes, washed twice again in PBS and then with distilled water and the 

35 number of (live) foci counted. The cells could then be re-stained with crystal violet 
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by incubating with an 0.1% crystal violet (Sigma C0775) solution in 20% methanol 
for 20 minutes at room temperature, followed by 3 washes in 20% methanol and a 
wash with distilled water. 

5 Preparation Of Cells Cured Of Endogenous Replicon 

Replicon enhanced cells designated 10IFN and C1.60/cu were produced 
using different HCV inhibitory agents. Based on the techniques described herein 
additional replicon enhanced clones can readily be obtained. 

10DFN was obtained by curing a Huh-7 cell of a replicon using human 

10 IFN-a2b. Huh-7 cells containing HCV replicons (designated HBI10, HBIII4, HBHI27 
and HBHI18) were cultured for 1 1 days in the presence of 100 U/ml recombinant 
human IFN-a2b (Intron-A, Schering-Plough), and subsequently for 4 days in the 
absence of EFN-oc2b. At several time points during this period, the clones were 
analyzed for the presence of HCV proteins and RNA by Western and Northern 

15 blotting. After 7 days of incubation with IFN-ot2b, HCV proteins could no longer be 
detected in any of these clones by Western blotting and similar effects were seen with 
RNA signals in Northern blots. IFN-cc2b treated cells were stored in liquid nitrogen 
until used for transfection experiments. 

C1.60/cu was obtained by curing a Huh-7 cell of a replicon using an 

20 HCV inhibitory compound. The presence of HCV RNA was determined using PCR 
(TaqMan) at 4, 9, 12 and 15 days. From day 9 the amount of HCV RNA was below 
the limit of detection. To further test the disappearance of the replicon, 4 million cells 
of cured Clone 60 cells (after the 15 days of treatment) were plated in the presence of 
1 mg/ml G-418. No viable cells were observed, confirming that absence of HCV 

25 replicons able to confer G-418 resistance. 

Example 2: Selection and Characterization of Cell Clones Containing Functional 
HCV Replicons 

Huh-7 cells (2-8xl0 6 ) were transfected by electroporation with in vitro 
30 transcribed replicon RNAs (10-20 ng), plated at a density ranging from 2.5xl0 3 to 
10xl0 3 /cm 2 , and cultured in the presence of 0.8-1 mg/ml G418. The majority of 
replicon transfected cells became transiently resistant to G418 and duplicated 
normally for 7 to 12 days in the presence of the drug, while cells transfected with 
irrelevant RNAs and mock transfected cells did not survive more than 7 days (data not 
35 shown). Transient resistance to G418 was likely due to persistence of the Neo protein 
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expressed from the transfected RNA, since it was observed also with mutated 
replicons unable to replicate. Approximately 2 weeks after transfection, transient 
resistance declined, most cells died and small colonies of cells permanently resistant 
to the antibiotic became visible in samples transfected with HCVNeo.17 RNA, but 
5 not in cells transfected with other replicon RNAs. 

In several experiments, the frequency of G418 resistant clones ranged 
between 10 and 100 clones per 10 6 transfected cells. About 20 G418 resistant 
colonies were isolated, expanded and molecularly characterized. PCR and RT-PCR 
analysis of nucleic acids indicated that all clones contained HCV RNA but not HCV 

10 DNA, demonstrating that G4 1 8 resistance was due to the presence of functional 

replicons (data not shown). This result was confirmed by Northern blot analysis and 
metabolic labeling with 3H-uridine, which revealed the presence of both genomic and 
antigenomic HCV RNAs of the expected size (data not shown). Lastly, western blot, 
immunoprecipitation and immunofluorescence experiments showed that these clones 

15 expressed all HCV non-structural proteins as well as Neo protein (data not shown). 

Clones differed in terms of cell morphology and growth rate. Replicon 
RNA copy number (500-10000 molecules/cell) and viral protein expression also 
varied between different clones (data not shown). However, the amount of replicon 
RNA and proteins also varied with passages and with culture conditions and was 

20 higher when cells were not allowed to reach confluency, suggesting that replicons 
replicated more efficiently in dividing cells than in resting cells. Processing of the 
viral polyprotein occurred with kinetics similar to those observed in transfected cells. 

Interestingly, in all tested clones HCV replication was efficiently 
inhibited by treating the cells with IFN-ct2b. The EC50 was between 1 and 10 U/ml, 

25 depending on the clone. 

Example 3: Identification of Adaptive Mutations 

The low number of G418 resistant clones derived from HCVNeo.17 
RNA transfection suggested that replication could require mutation(s) capable of 
30 adapting the replicon to the host cell (adaptive mutations) and/or that only a small 
percentage of Huh-7 cells were competent for HCV replication. To verify the first 
hypothesis, mutations in replicons RNAs derived from selected cell clones were 
identified. 



23 



WO 02/059321 



PCT/EP02/00526 



RNA sequences for different replicons were determined using standard 
techniques. Such techniques involved isolating RNA from several independent 
clones, reverse transcription to produce cDNA, amplifying cDNAs by PCR and 
cloning into an appropriate vector. The cDNA spanning almost the entire HCV NS 
5 region (126 bp at the 3* end of the EMCV IRES and 5650 bp of the HCV NS region 
(Le., the entire NS ORF and 298 nucleotides at the 3' end) from 5 clones (HBI10, 
HBIII12, HBHI18, HBDI27, HBIVl) were recloned and sequenced. In addition, the 
NS5A coding region (nt. 4784-6162) from 3 additional clones (HB 77, HB 68 and HB 
60) were recloned and sequenced 
10 To discriminate mutations present in the replicon RNA from those 

derived from the cloning procedure, at least 2 isolates derived from independent RT- 
PCR experiments were sequenced for each cell clone. Comparison of the nucleotide 
sequences with the parental sequence indicated that each isolate contained several 
mutations (Tables 4A and 4B). 

15 

TABLE 4A 



Cell clone 


HBIII 12 


HBIII 18 


HBI 10 


HBIII 27 


isolate 


4 


29 


28 


61 


I 12 


43 


13 


72 




1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674- 
7460 


1674-7460 


1674-7460 


EMCV 
IRES 
126 bp 


A @ 1736 


A @ 1736 




C 1752 T 








T 1678 C 


NS3 
1895 bp 


G2009C 

A 2698 G 
G2764A 

A 3256 G 
T3273 C 


A2330G 
C2505T 
G 2764 A 
T3085C 


T2150C 
C 2196 A 
T 3023 A 
T3134C 
C3267T 


T2015C 

A 2338 G 
C2616T 
A2664G 
A 3148 G 
T 3286 C 
C3615T 
C3657T 


T 1811 A 
A 2330 G 
T2666C 
T3395C 


A 2330 G 
A 2882 G 
T3673C 


G2009C 
T2015C 

C2336G 
A 3130 T 
A 3401 G 
A 3518 C 


G2009C 

C 2052 A 
G2644A 
C 2803 A 
T 2823 A 
T 3692 C 


NS4A 
161 bp 


T3790C 




A3847G 


T 3827 A 


T 3742 C 




A 3743 G 


A 3797 G 


NS4B 
782 bp 


T3869C 
A 4107 G 
T4I85C 
A 4428 G 


C4283T 
C4429T 


G4300A 


A 4136 G 
A 426143 
G 4309 A 
A 4449 G 


T4290C 


A 4053 G 
A 2496 C 
T4316G 


G 3880 A 
T4200C 
A 4366 G 


C4547T 
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TABLE 4 A 



Cell clone 


HBIII 12 


HBIII 18 


HBI 10 


HBIII 27 


isolate 


4 


29 


28 


61 


12 


43 


A J 


/z 




1674- 


1674- 


1674- 


% dirt A 
16/4- 


16/4- 


1674- 


1674-7460 


1674-7460 




7/4/Cft 


1 *#ov 


746I1 
/*tow 


7460 


74A0 








NS5A 


A 45147 P 


CI 477R A 


P 5243 T 


C 4729 A 


A 4694 T 




A 4RSS n 

r\ HOJJ VJ 


/\ *fOOO VJ 


1340 bp 


VJ J 1 JO A\ 


/i tOtJ KM 


A 5486 G 

A\ JtOU \J 


T4993 C 


AAA O 


4 47A f CI 




r* 40R5 T 










4842 










G5175C 


C5243T 


C5596T 


G5095 A 


T5237C 


AAA @ 


T5318C 


T 5030 A 














4842 








C5243T 


G5512T 


C5«2JA 


7"53J4 C 




T5368C 


A 5574 G 


T5090A 




C5390T 


A 5521 G 




A 5374 T 






G 5866 A 


T5318C 




A 5719 G 


A5600G 




T 5379 A 








A 5328 G 






A5740C 




T5480C 








A 5399 G 










A 5513 G 








A5574G 










T5977C 










NS5B 


T6316C 


A6406G 


T6074C 


A 6150 G 


A6911G 


A 5986 G 


G6479C 


G 6156 A 


1477 bp 


T 6589 C 


G 6756 A 


A 6541 G 


A 6218 G 




T6099C 


C6870T 


G 7434 A 




T 7370 C 


G6963T 


A 6732 G 


T 7352 A 




C6J4J T 


A 7213 G 


T7444C 








A 7350 T 






G6463 A 


T7448C 










A 7359 G 






C6849T 


















T6865C 







Clone name and isolate number are indicated in the first and second row, respectively. 
The first and the last nucleotide of the region that was recloned and sequenced are indicated in the third 
5 row. 

Nucleotide (IUB code) substitutions are indicated with the original nucleotide, its position and mutated 
nucleotide. 

Nucleotide(s) insertions are indicated with the nucleotide(s), the symbol @ and the position of the 

nucleotide preceding insertion. 
10 Numbering refers to the first nucleotide of the replicon sequence (EMBL-genbank No. AJ242652). 

The region in which mutations are located and the nucleotide length of each region are indicated in the 

left most column. 

Silent mutations are in italic. 

Non sense mutations are underlined. 
1 5 Consensus mutations are bold. 



TABLE 4B 



Cell clone 


HBIV1 


HB77 


HB 68 


HB60 


isolate 


85 


93 


10 


14 


42 


1 


13 


7 




1674- 
7460 


1674- 
7460 


4784- 
6162 


4465- 
6162 


4784- 
6162 


4465- 
6162 


4784-6162 


4784-6162 


EMCV 
IRES 
126 bp 




A @ 1736 














NS3 
1895 bp 


A 3403 G 


A 2572 G 
A 3454 G 
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TABLE 4B 



Cell clone 


HB1V1 


HB77 


HB 68 


HB60 


isolate 


85 


93 


10 


14 


42 


! 1 


13 


1 




1674- 
7460 


1674- 
7460 


4784- 
6162 


4465- 
6162 


4784- 
6162 


4465- 
6162 


4784-6162 


4784-6162 


161 bp 


















NS4B 
782 bp 


A 4084 G 


C3892T 














NS5A 
1340 bp 


T4742C 
C5315T 
G5431T 
7*575/ C 
T5797C 


A4847C 

A 5225 G 
C5315T 
G 5320 A 
T 5356 A 
G 5523 A 
T 5888 A 


C4813T 
G5060C 
C5337A 


A4699C 
A5161G 
C 5337 A 
A 5459 G 
T5977C 


T5171G 
C5298T 
C5337A 
A 5639 G 
A5969G 


T4587C 
T4972C 
A 5094 G 
A 5278 G 
G 5320 A 
C 55327 


A 482 J G 
G 5320 A 
A 5414 G 
T5601G 
C5808T 


C5337G 
C5551T 
G 5806 A 


NS5B 
1477 bp 


T 6144 A 
A 6365 G 
A6656G 
A6677G 
T68S5C 
T 6947 A 
T6997C 
G7041T 
A 7187 C 


T6855C 
A 7135 G 
T7171C 















See Table 4A legend. 

The frequency of mutations ranged between 1.7 x 10" 3 and 4.5 x 10" 3 
5 (average 3 x 10" 3 ). The majority of mutations were nucleotide substitutions, although 
insertions of 1 or more nucleotides were also observed (Tables 4A and 4B). 

Approximately 85% of the mutations found only in 1 isolate (non- 
consensus) were randomly distributed in the recloned fragment, and possibly include 
mis-incorporation during the PCR amplifications. Conversely, the remaining 15% of 
1 0 the mutations were common to 2 or more isolates derived from independent RT-PCR 
experiments (consensus mutations), and presumably reflected mutations present in the 
template RNA. 

Consensus mutations were found in all isolates and were either 
common to isolates derived from the same clone (consensus A), or to isolates derived 
15 from different clones (consensus B). Analysis of additional isolates derived from the 
same cell clones indicated that consensus A mutations were not always present in all 
isolates derived from one clone (data not shown). This observation, together with the 
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presence of consensus B mutations, suggests that, even within a single cell clone, 
replicons exist as quasi-species of molecules with different sequences. 

At variance with non-consensus mutations, consensus mutations were 
not randomly distributed but were clustered in the regions coding for the NSSA 
5 protein (frequency 1 x 10* 3 ) and for the NS3 protein (frequency 0.5 x 10" 3 ). Only one 
consensus mutation was found in the region coding for the NSSB protein (frequency 
0, 1 x 10° nucleotides) and none in the regions coding for NS4 A and NS4B. 
Interestingly, 1 consensus mutation was observed also in the EMCV IRES. 

With the exception of 2 silent mutations found in NSSA and NSSB, 

10 consensus mutations occurring in the NS region resulted in changes in the deduced 
amino acid sequence (Tables 5A and SB). Noticeably, these amino acid changes 
occurred in residues that are conserved in all or most natural HCV isolates. 
Interestingly, clones HB 77 and HB 60 displayed different nucleotide substitutions 
(C5337A and C5337G, respectively) resulting in the same amino acidic mutation (S 

15 2204 R). 



TABLE 5 A 



Cell clone 


HBI11 12 


HBHI 18 


HBI 10 


HB111 27 


isolate 


4 


29 


28 


61 


12 


43 


13 


72 


NS3 


G 1095 A 
A 1347 T 


E 1202 G 
A 1347 T 






E1202G 


E1202G 


G 1095 A 


G 1095 A 


NS4A 


















NS4B 


















NS5A 


N2041T 
S2173F 


S2173F 


S2173F 


E2263 


K @ 2039 


K @ 2039 


L2198S 
R 2283 R 


L2198S 
R2283R 


NSSB 



















See Table 4 A legend. 

20 
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TABLE 5B 



Cell clone 


HBIV1 


HE 17 


HB68 


HB60 


isolate 


85 


93 


10 


14 


42 


1 


13 


7 


NS3 


















NS4A 


















NS4B 


















NS5A 


S 2197 F 


N2041T 

S2197F 
A 2199 T 


S2204 
R 


S2204 
R 


S2204R 


A 2199 T 


A 2199 T 


S2204R 


NS5B ; 


N2710N 


N2710N 















See Table 4A legend. 



10 



Example 4: Functional Characterization of Consensus Mutations 

The identification of consensus mutations in recloned replicons 
indicated that replication proficiency of replicon RNAs contained in selected cell 
clones depended from the presence of such mutations. To substantiate this 
hypothesis, the effect of several consensus mutations on replication were analyzed. 

Consensus mutations found in the NS5A region were more closely 
analyzed. Consensus mutations were segregated from the non-consensus ones, and 
pHCVNeo.17 derivatives containing single or multiple consensus mutations were 
constructed (Table 6). 



TABLE 6 



Construct 



Consensus mutations 



G418cfu/i0 5 
transfected 
cells 



P HCVNeol7.wt 
pHCVNeol7.GAA 
pHCVNeol7.mO 
pHCVNeoi7.ml 
pHCVNeol7.m2 
P HCVNeol7.m3 
pHCVNeol7.m4 



NS3 



NS5A 



S2204R 
N204IT 
S2173F 
S2197F 
L2I98S 



EMCVIRES 



0-3 
0 

30-130 
0-3 
15-60 
160-500 
30-50 
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TABLE 6 



Construct 


Consensus mutations 


G418cfu/10 > 
transfected 
cells 




NS3 


NS5A 


EMCVIRES 




pHCVNeol7.m5 




K@2039 




25-55 


pHCVNeo!7.m6 


E1202G; A1347T 


S2173F 


Extra A 


13-100 


P HCVNeol7.m7 




N2041T;S2173F 




0-1 


pHCVNeol7.m8 




N2041T;S2197F 




360-500 


pHCVNeol7.m9 




N2041T;L2198S 




140-170 


pHCVNeol7.ml0 


E1202G 


K<§>2039 




1060 


pHCVNeol7.mil 




S2197F;A2199T 




900 


pHCVNeol7.ml2 




N2041T; S2197F; A2199T 




>1000 


pHCVNeol7.ml3 




N2041T;S2197F 


Extra A 


100 


pHCVNeol7.ml4 




S2197F; A2199T 


Extra A 


>500 


pHCVNeol7.ml5 




A2199T 




300-600 



Huh-7 cells (2x10*) were transfected with 10 ug of RN A transcribed from the indicated constructs. 
Approximately 2x10 s cells were plated in a 10 cm tissue culture dish and cultured with 1 mg/ml G418 
for 20 days. 

Colonies surviving selection were stained with crystal violet and counted. 



RNAs transcribed in vitro from these constructs were transfected in 
Huh-7 cells and the affect on replication was estimated by counting neomycin 
resistant colonies (G418 cfu). As shown in Table 6, all but 1 construct containing 
single consensus mutations showed a significant increase on G418 cfu efficiency, thus 
10 indicating that the corresponding mutations improved replication. Noticeably, 2 
mutants containing single mutations in NS5A (m3 and m 15) were clearly more 
effective than all other single mutants. Results of mutants containing 2 or more 
mutations, indicated the presence of a synergistic effect in some combinations (m8, 
m9, mil and possibly mlO), but also a slightly antagonistic effect in 1 mutant (m7). 

15 

Example 5: Replicon Replication in the Absence of Selection 

Replication of HCV repl icons in the absence of a G418 selection was 
detected using quantitative PCR (TaqMan). At 24 hours post-transfection a large 
amount of replicon RNA was detected in cells transfected with all replicons, including 
20 the GAA control replicon containing mutations in the catalytic GDD motif of the 

NS5B polymerase. This result suggested that analysis at very early time points (up to 
48 hour post-transfection) essentially measured the input RNA. Northern blot 
analysis also indicated that after 24 hours the majority of the transfected RNA was 
degraded intracellular^ (data not shown). 
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Analysis at later time points showed that the amount of rcplicon RNA 
was considerably reduced at 4 days and eventually became undetectable (6/8 days) in 
cells transfected with replicon HCVNeol7.wt, but was still high in cells transfected 
with replicons mO, m3 and ml5 (Table 7). At day six, that the amount of replicon 
5 RNA became undetectable in cells transfected with replicon HCVNeol7.wt, mO, and 
m2, but was detectable in cells transfected with replicon m3 and ml5 (Table 7). 



TABLE 7 



Name 


HuH7 


RNA equ. 


RNA equ. 




day 4 


day 6 


Wt 


1 X 


1 X 


hcvneol7.m0 


3x 


lx 


hcvneol7.m2 


lx 


1 X 


hcvneol7.m3 


5x 


3x 


hcvneo!7.ml5 


6x 


5x 



10 

Persistence of mO, m3 and ml5 replicons RNA was abolished by 
treatment with interferon-a or with an HCV inhibitory compound (data not shown). 
Moreover, RNA persistence was not observed with mutated replicons carrying the 
NS5B GAA mutation besides adaptive mutations (data not shown). Taken together, 

15 these results demonstrated that quantitative PCR could be used to monitor replication 
at early times post-transfection, and can be used to evaluate the replication proficiency 
of replicon RNAs containing mutations. 

Comparison of the results shown in Tables 6 and 7, indicated that there 
was a good correlation between the amount of replicon RNA detected by TaqMan and 

20 the G418 cfu efficiency. Nonetheless, some mutants (m2, m3) showed a pronounced 
effect on G418 cfu efficiency, and little if any effect on early replication as measured 
by TaqMan PCR, while other mutants (mO) showed the reverse behavior. 
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Example 6: HCV Rcplicon Enhanced Cells 

HCV replicon enhanced cells were produced by introducing an HCV 
replicon into a host, then curing the host of the replicon. Adaptive mutations (or 
combinations of them) by themselves increased up to 2 orders of magnitude the G418 
5 cfu efficiency and enhanced early replication comparably. Nonetheless, even with the 
most effective mutants, only a small percentage of transfected cells (<5 %, data not 
shown) gave rise to G418 resistant clones containing functional replicons. This 
observation was attributed, at least in part to a low cloning efficiency of Huh-7 cells 
(data not shown), and only a fraction of Huh-7 cells being competent for replication. 
10 Several clones were cured of endogenous replicons by treating them 

for about 2 weeks with IFN-a or with a HCV inhibitory compound. Analysis at the 
end of the treatment showed that neither viral proteins nor replicon RNA could be 
detected. 

Cured cells (10IFN and Cl.60/cu) were transfected with mutated 
15 replicons and replication efficiency was determined by counting neomycin resistant 
clones (10EFN) or by TaqMan (10IFN and CI.60/cu). As shown in Table 8, for all 
tested replicons the G418 cfu efficiency in 10IFN cells was at least 5 fold higher than 
in parental Huh-7 cells. This increase in G4I8 cfu efficiency was particularly relevant 
for a subset of mutants (m3, m5, m8, m9, ml5). 

20 

TABLE 8 



Construct 


Consensus mutations 


G418cfu/10> 
transfected cells 




NS3 


NS5A 


EMC VIRES 




pHCVNeoH.wt 








12-56 


pHCVNeolZGAA 








0 


pHCVNeo!7.m0 




S2204R 




180-1000 


pHCVNcol7.mi 




N2041T 




8-13 


pHCVNcol7.m2 




S2173F 




2000 


pHCVNeol7.m3 




S2197F 




1600 - 3000 


pHCVNeol7.m4 




L2198S 




190-650 


pHCVNeol7.m5 




K@2039 




1600-3000 


pHCVNeo!7.m6 


E1202G; A1347T 


S2173F 


extra A 


600-2000 


P HCVNeol7.m7 




N2041T;S2173F 




170-800 


pHCVNcoI7.m8 




N2041T; S2197F 




>4000 


pHCVNeol7.m9 




N204IT;L2198S 




1400-3000 


pHCVNeo!7.ml0 


E1202G 


K@2039 




>4000 


pHCVNeoi7.mil 




S2197F; A2I99T 




>4000 



31 



WO 02/059321 



PCT/EP02/00526 



TABLE 8 



Construct 


Consensus mutations 


G418cfu/10 5 
transfected cells 


pHCVNepl7.ml2 
pHCVNeol7.ml3 
P HCVNeol7.ml4 
P HCVNeol7.mI5 


NS3 NS5A EMCV IRES 

N2041T; S2197F; A2199T 

N2041T; S2197F extra A 
S2197F; A2199T extra A 
A2199T 


>4000 
>4000 
>4000 
>4000 



10 



15 



Approximately 2x10 s cells were plated in a 10 cm tissue culture dish and cultured with 1 mg/ml G418 
for 20 days. 

Colonies surviving selection were stained with crystal violet and counted. 

Strikingly, the best mutants yielded a number of G4 18 resistant clones 
ranging between 20 and 80% of the cell clones which grew in the absence of G418 
(data not shown), thus indicating that the majority of 10IFN cells were competent for 
replication. This result was confirmed by TaqMan analysis (Table 9), in which the 
fold increase versus the parental Huh-7 cells was very high. The data indicates that 
repjicons carrying adaptive mutations replicate vigorously in replicon enhanced cells 
such as 10IFN and C1.60/cu. 

TABLE 9 



Name 


10E 


FN 


C1.6I 


O/cu. 


RNAequ. 


RNA equ. 


RNAequ. 


RNAequ. 


Wt 

hcvneol7.m0 
hcvneol7.m2 
hcvneol7.m3 
hcvneol7.ml5 


Day 4 
lx 

46 x 
2x 

68 x 

247 x 


day 6 
lx 

12 x 
2x 

49 x 

80x 


day 4 

1 X 

78 x 
lx 
19 x 

268 x ! 


Day 6 

lx 
512 x 

2x 
392 x 
5518 x 



Expression of viral proteins was determined in replicon enhanced cells 
20 using an ELISA assay designed to detect the NS3 protein in transfected cells plated in 
96 wells microliter plates (Cell-ELISA). As shown in Table 10, 24 hours post- 
transfection cells transfected with all tested replicons expressed low but detectable 
levels of the NS3 protein. 
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TABLE 10 





NS3 arbitrary units 


■ • 


24 h i 


>.t. 


96 r 


i p.t. 


Name 




+ IFN 




+IFN 


Construct 










Mock 


1 


1 


1 


1 


pHCVNeol7.wt 


3.7 


4.2 


1.2 


1.3 


pHCVNeol7.GAA 


3.1 


3.2 


1.1 


1 


pHCVNeol7.mO 


3.4 


3.2 


9.9 


0.8 


pHCVNeol7.m3 


5.7 


4.6 


4.7 


1.5 


pHCVNeol7.m8 


6.6 


5.1 


15.1 


1.4 


P HCVNeol7.mlO 


8 


5.6 


9.2 


1.8 


pHCVNeol7.mll 


8.4 


6.2 


13.6 


1.8 



10IFN cells (2x10*) were transfected with 1 0 ug of RNA transcribed from the indicated constructs. 
Cells were plated in 96 wells microliter plates as indicated in Example 1 . 



5 Where indicated (+IFN), I FN -a (100 U/ml) was added to the culture medium 4 hours post-transfection. 
At the indicated times post-transfection, cells were fixed and analyzed by Cell-ELISA. 

The early expression shown in Table 10 is likely due to translation of 

transfected RNA, since it was comparable in all replicons (including that carrying the 

GAA mutation) and was not affected by IFN-ol At 4 days post-transfection, NS3 

10 expression persisted or increased in cells transfected with replicons carrying 

consensus mutations, but could not be detected anymore in cells transfected with wt 
and GAA replicons. In addition, NS3 expression was almost completely abolished 
when cells were cultured in the presence of EFN-cl 

Taken together, these results indicated that the level of NS3 expression 

15 reflected the replication rate. Indeed, NS3 expression level (Table 10) paralleled the 
RNA level measured by TaqMan (Table 9). The high replication proficiency of 
10IFN cells was further confirmed by immunofluorescence experiments which 
showed that more than 50% of cells transfected with replicons m8 and ml 1 expressed 
high level of viral proteins, and that expression was almost completely abolished by 

20 DFN-ol 

Example 7: Replication of Full Length Constructs 

This example illustrates the ability of a full length HCV genome 
containing adaptive mutations described herein to replicate in a replicon enhanced 
25 host cell. The full length sequence of the HCV isolate Con-1 (EMBL-Genbank No. 
AJ238799) (plasmid pHCVRBFLwt) and 2 derivatives containing either the N204 IT 
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and S2173 F mutations (plasmid pHCVRBFL.m8) or the S2197F and A2199T 
mutations (plasmid pHCVRBFL.mll) were used as starting constructs. 

RNAs transcribed from the starting constructs were transfected in 
10IFN cells and their replication proficiency was assessed by Cell-ELISA, 
5 immunofluorescence and TaqMan. Both constructs containing consensus mutations 
(pHCVRBFL.m8 and pHCVRBFLml 1) replicated, while no sign of replication was 
observed with the wt. construct (data not shown). 

Example 8: Replicons with Reporter Gene 

10 This example illustrates an HCV replicon containing adaptive 

mutations and a reporter gene. A pHCVNeol7.wt derivative where the Neo coding 
region was substituted with that coding for human placenta] secretory alkaline 
phosphatase (pRBSEAPS.wt) and a derivative also containing the N2041T and 
S2173F mutations (plasmid pRBSEAP5.m8) were constructed. RNAs transcribed 

15 from these plasmids were transfected in 10IFN cells and their replication proficiency 
was assessed by measuring secretion of alkaline phosphatase. Analysis of the kinetics 
of secretion suggested that only plasmid pRBSEAP5.m8 was competent for 
replication (data not shown). 

20 Example 9: SEP. ED. Nos. 1 and 2 

SEQ. ED. NOs. 1 and 2 are provided as follows: 

SEP. ID. NO. 1 

MSTNPKPQRKTKRNTNRRPQDVKFPGGGQIVGGV^ 
25 SERSQPRGRRQPIPKARQPEGRAWAQPGYPWPLYGNEGLGWAGWLLSPRGS 

RPSWGPTDPRRRSRNIXjKVTOTLTCG 

RVUEDGVNYATGNII>GCSFSIFIX^ 

NASIVYEAADMIMHTPGCVPCVRENNSSRCW 

HVDIXVGAAAirSAMYVGD 
30 VTGHRMAWDMMMNWSPTA 

YYSMVGNWAKVLIVMUJAGVDGGTYX^ 

LVm^GSWHINRTAIJ^CNDSLOT^ 

AQGWGPITYNEiSHSSDQRPYCWHYAPRPCGIVPAAQVCGPVYCFTPSPVVVG 
TTDRFGVPTYSWGENETD\n^LU^NTRPPQGNWFGCTWMN 
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CNIGGIGNKTLTCPTDCFRKHPEATYTKCGSGPWLTPRCLVHYPYRLWHYPC 
TVNFITFKVRMWGGVEHRLEAACNW^ 

QVLPCSFTTLPAl^GUHlJIQ>rV\^VQYLYGIGSAWSFAIKWEYV^ 
ADARVCACLWMMIXIAQAEAALE^VVIJ^AASVAGAHGII^FLVFFCAAWY 
5 IKGRLVPGAAYALYGVWPIJLIJJXAIPPRAYAMDREMAASCGGAVWGL 
TLSPHYKmAPJJWWLQYFTreAEAHLQVW^ 
UFTITKIIlj\IlXJPm\ajQAGriXWYF^ 
ALMKI^ALTGTYVYDHLTPIJLDWAHAGIJU)I^VA^ 
GADTAACGDIILGIJVSARRGREIHLGPADSliGQGWIUJLAPITAYSO^TOGL 

10 LGCIITSLTGRDRNQVEGEVQVVSTATQSFIJVTCVNGVCWTVYHGAGSKTLA 
GPKGPITQMYTNVDQDLVGWQAPPGARSLTPCTCGSSDLYLVTRHADVIPVR 
RRGDSRGSIJL5PRPVSY1JCGSSGGPLLCPSGHAVGIFRAAVCTRGVAKAVDFV 
PVESMETTN^SPVFTDNSSPPAVPQTFQVAHLHAPTGSGKSTKVPAAYAAQG 
Y K VLVLNPS V A ATLGFG A YMSKAHGIDPN IRTG VRTITTG APITYSTYGKFLA 

1 5 DGGCSGGA YDmCDECHSTDSTTILGIGTVLDQAETAGARLVVLATATPPGS V 
TVPHP^^^EEVAI^STGFJPFYGKAIPIFTIKGGRHLIFCHSKKKCDFJLAAKLSGLG 
U^AVAYYRGLJ)VSVIPTSGDVrVVATDAmTGFTGDFDSVIDCNTCVTQTVD 
FSLDPTFTIEITrVPQDAVSRSQRRGRTGRGRMGIYRFAaTGERPSGMFDSSVL 
CECYDAGCAWYELTPAETSVRLRAYIJsrrPGI^VCQDHl^FWESVFTGLTHlD 

20 AHFI^QTKQAGDNFPYLVAYQATVCARAQAPPPSWDQMWKCLIRLKPTLHG 
PTPLLYRLGAVQNEVTTTHPITKYIMACMSADl^V\nrSTWVLVGGVUVALAA 
YCLTTGSVVIVGRIILSGKPAnPDREVLYREFDEMEECASHLPYIEQGMQLAEQ 
FXQKAIGLLQTATKQAEAAAPVVESKWRTLEAFWAKHMWNHSGIQYLAGLS 
TLreNPAIASLMAFTASITSPLTTQHTLlJMLGGWVAAQLAPPSAASAFVGAG 

25 IAGAAVGSIGLGKVLVDnAGYGAGVAGALVAFTCVMSGEMPSTEDLVNLLPA 
Il^PGALVVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPE 
SDAAAR\rrQIL^SLTrrQLLKRLHQWINEDCSTPCSGSWERDVWDWICTVLTD 
FKTWLQSKIIPRLPGVPFFSCQRGYKGVWRGIXjIMQTTCPCGAQITGHVKNG 
SNnUVGPRTCSNTWHGTFPINAYTTGPCTPSPAPNYSRALWRVAAEEYVEVT 

30 RVGDFHY\TGMTrohAfKCPCQVPAPEFFTEVTX3VP^ 

TFLVGONfQYLVGSQIJ'CEPEPDVAVLTSMLTDPSHlTAETAKRRLARGSPPSL 
ASSSASQLSAPSLKATCTTRHDSPDADLffiANLLWRQEMGGNITRVESENKVV 
OJDSFEPUiAEEDEREVSVPAEIlJlRSRKFPRAMPnVARPDYNPPLLESWKDPD 
YVPPVVHGCPLPPAKAPPIPPPRRKRTVVLSESTVSSALAELATKTFGSSESSA 

35 VDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEE 
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ASEDVVCCSMSYTWTGALrrrcAA^ 
SUM^KXVTFDRLQX^D^ 
ARSKFGYGAKDVRNLSSKAVNHIRS\^ 
PEKGGRKPARLIVFPDLGWVCEKMALYDVVSTU^ 
5 VEFLVNAWKAKKCPMGFAYDTR<^ 

IRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSC^ 
AKUPCTMLVCGD^ 

YDLEUTSC^SNVSVAHDASGKRVYYLTRDPT^ 
GMIMYAPTLWARMIIJVlTH^ 
10 RmGI^AFSIJISYSPGEINRVASCUUttJGWPIJl 
AATCGKYUWVAWTKLKL^ 
RWFMWCLLLLSVGVGIYLLPNR 

SEP. ID. NO. 2: 
15 gccagcccccgattgggggcgacactccaccatagatca^ 

gccatggcgttagtatgagtglcgtgcagcctccaggaccccccctcccgggagagccatagt 

agtacaccggaattgccaggacgaccgggtcxtttcttggatc^ 

agactgctagccgagtagtgttgggtcgcgaaa^^ 

ctcgtagaccgtgcaccatgagcacgaatcctaaacctcaaagaaaaaccaaacgtaacaccaaccgccgcccacagga 
20 cgtcaagttcccgggcggtggtcagatcgtcggt^^ 

gactaggaagacttccgagcggtcgcaacctcglggaaggcgacaacctatccccaaggctcgccagcccgagggtagg 

gcctgggctcagaxgggtacccctggcccctctatggcaatgagggcttggggtgggcaggat^ 

ggctctcggcctagttggggcrccacggacccccgg^ 

ggcttcgccgatctcatggggtacattccgctcgtcggcgcccccctagggggcgctgccagggcc^ 
25 ccggg^tggaggacggcgtgaactatgcaacagggaatctgcccggttgctccttttctat^ . 
gtttgaccatcccagcttccgcttatg^ 

cattgtgtatgaggcagcggacatgatcatgcalacccccggglgcgtgccctgcgttcgggagaa^ 

tgggtagcgctcactcccacgctcgcggccagg 

gttg^cggctgctctctgctccgctatgtac# 

30 gcctcgccggcacgagacagtacaggactgcaattgctcaatatatcccggccacgtgacaggtcaccgtatggcttggga 
tatgatgatgaaetggtcacctacagcagccctagtggtat^ 
cgggggcccattggggagtcctagcgggccttgccta^ 
ctctttgccggcgttgacgggggaacctatgfgaca 

cacccgggtcatcccagaaaatccagcttgtaaacaccaacggcagctggcacatcaacaggactgccctgaactgcaat 
35 gactccctcaacactgggttccttgctgcgctgttctacgtgcacaagttcaactcatctggatgcccagagcgcatggccag 
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ctgcagccccatcgacgcgttcgctcaggg^ 

ttgttggcactacgcaccccggccgtgcggtatcgtacccgcggcgcaggt^ 

cctgtcgtggtggggacgarcgaccg^ 

aacaacacgcggccgccgcaaggcaactggtttggctgtacatggatgaatagcactgggttcaccaagacgtgcggg^ 

ccccccgtgtaacatcggggggatcggcaataaaaccttgacctgccccacggac^ 

cttacaccaagtgtggttcggggccttggttgacarc^ 

actgtcaactttaccatcttcaaggttaggatgtacgtggggggagtggagcacaggcte^ 

gaggagagcgttgtaacxtggaggacagggacagatcagagcttagcccgctgctgctgtctacaacgg 

ttgcxctgttccttcaccaccctaccggctctgtcx^ 

acggtatagggtcggcggttgtctcctttgcaatcaaatgggagtatgtcctgttgctcttccttctto^ 

ctgpgcctgcttgtggatgatgctgctgatagrt^ 

gtggccggggcgcatggcattctctccttccte^ 

ggcatatgccctctacggcgtatggccgctactcctgctcctgctggcgttaccac^ 

gatggcagcatcgtgcggaggcgcggttttcgtaggtctgatactcttgaccttgtcaccgcactataagct^ 

gctcatatggtggttacaatattttatcaccagggccgaggcacacttgcaagtgtggatccccccccto^ 

gccgcgatgccgtcatcctcctcacgtgcgcg^ 

ggtccactcatggtgctccaggctggtataacc 

ggtgcggaaggttgctgggggtcattatgtccaaatggctctcatgaagttggccgcactgacaggtacgta^ 

atctcaccccactgcgggactgggcccacgcgggcrt^ 

ggagaccaaggftatcacctggggggcagacaccgcggcgt^ 

gggggagggagatacatctgggaccggcagacagccttgaagggcaggggtggcgactcctcgcgcctattac^ 

ctcceaacagacgcgaggcctacttggctgcatcatcacto^^ 

gtccaagtggtctccaccgcaaeacaatctt^ 

ctcaaagacccttgccggcccaaagggcccaatc^ 

cgcccaxggggcgcgttccttgacaccatgc^ 

tccggtgcgccggcggggcgacagcagggggagccta^ 

ggtccactgctctgcccctcggggcacgctgt^ 

gactttgtacccgtcgagtctatggaaacca^ 

agacattccaggtggcccatctacacgcccctactggtagcggcaagagcactaaggtgccggctgcgtatgcagccc 

gggtataaggtgcttgtcctgaacccgtccgtcgccgccaccctaggtttcggggcgtatatgtctaaggca^ 

ccctaacatcagaaccggggtaaggaccatcaccacgggtgcccccatcacgtactccacctatggcaagtttc^ 

ggtggttgctctgggggcgcctatgacatcataatatgtgatgagtgccactcaactgactcgaccactatcctgggcatcgg 

cacagtcctggaccaagcggagacggctggagcgcgactcgtcgtgctcgccaccgctacgcctccgggatcgglcacc 

gtgccacatccaaacatcgaggaggtggctctgtccagcactggagaaatccccttttatggcaaagccatccccatcgaga 

ccatcaagggggggaggcacctcattttctgccattccaagaagaaatgtgatgagclcgccgcgaagctglccggcctcg 
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gactcaatgctgtagcatattaccgg^ 

gctctaatgacgggctttaccggcgamcgactcagtgatcgactgcaatacatgtgtcacccagaca^ 
ggacccgaccucaccattgagacgacgaccgtg^ 
aggggcaggatgggcamacaggmgtgactrcaggagaacggra 
5 gctatgacgcgggctgtgcttggtacgagctcacgcccgccgagacctcagttaggttgcgggcttacctaaacacaccag 
ggttgcccgtctgccaggaccatctggagttctgggagagcgtc^ 

cagactaagcaggcaggagacaacttcccctacctggtagcataccaggctacggtgtgcgccagggctcaggctccacc 

tccatcgtgggaccaaatgtggaagtgtctcatacggctaaagcctacgctgcacgggccaacgcccctgc^ 

ggagccgttcaaaacgaggttactaccacacaccccataaccaaatacatcatggcatgcatgtcggctgacctggaggtc 

10 gtcacgagcacctgggtgctggtaggcgga# 

gtgggcaggatcatcttgtccggaaagccggcxatcattcccgacagggaagtcctttaccgggagttcgatgagatgg^ 
gagtgcgcctcacacctcccttacatcgaacagggaatgcagctcgccgaacaattcaaacagaaggcaatcgggttgctg 
caaacagccaccaagcaagcggaggctgctgctcccgtggtggaatccaagtggcggaccctcgaagcctta 
agcatatgtggaatttcatcagcgggatacaatatttagcaggcttgtccactctgcctggcaaccccgcgatagcatcactga 

1 5 tggcattcacagcctctatcaccagcccgctcaccacccaacataccc tcctgtttaacatcctggggggatgggtggccgc 
ccaacttgctcctcccagcgctgcttctgctttcgtaggcgccggcatcgctggagcggctgttggcagcataggc^ 
aaggtgcttgtggatattttggcaggttatggagcaggggtggcaggcgcgctcgtggcctttaaggtca^ 
atgccctccaccgaggacctggttaacctactccctgctatcctctcccctggcgccctagtcgtcggggtcgtgtgcgcag 
gatactgcgtcggcacgtgggcccaggggagggggctgtgcagtggatgaaccggctgatagcgttcgcttcgcggggta 

20 accacgtctcccccacgcactatgtgcctgagagcgacgctgcagcacgtgtcactcagatcctctctagtcttaccatcact 
cagctgctgaagaggcttcaccagtggatcaacgaggactgrt^ 

gattggatatgcacggtgltgactgatttcaagacctggctccagtccaagctcctgccgcgattgccgggagtccccttcttc 
tcatgtcaacgtgggtacaagggagtctggcggggcgacggcatcatgcaaaccacctgcccatgtggagcacagatcac 
cggacatgtgaaaaacggttccatgaggatcgtggggcctaggacctgtagtaacacgtggcatggaacattccccattaac 

25 gcgtacaccacgggcccctgcacgccctccccggcgccaaattattctagggcgctgtggcgggtggc 

cgtggaggttacgcgggtgggggamccactacgtgacgggcatgaccactgacaacgtaaagtgcccgt^ 
ggcccccgaattcttcacagaagtggatggggtgcggttgcacaggtacgctccagcgtgcaaacccctcctacggga^ 
aggtcacattcctggtcgggctcaatcaatacctggttgggtcacagctcccatgcgagcccgaaccggacgtagcag 
cacttccatgctcaccgacccctcccacattacggcggagacggctaagcgtaggctggccaggggatctcccccctcctt 

30 ggccagctcatcagctagccagctgtctgcgccttccttgaaggcaacatgcactacccgtcatgactccccggacgctgac 
ctcatcgaggccaacctcctgtggcggcaggagatgggcgggaacatcacccgcgtggagtcagaaaataaggtagtaat 
tttggactctttcgagccgctccaagcggaggaggatgagagggaagtatccgttccggcggagalcctgcggaggtcca 
ggaaattccctcgagcgatgcccatatgggcacgcccggattacaaccctccactgttagagtcctggaaggacccggact 
acgtccctccagtggtacacgggtgtccattgccgcctgccaaggcccctccgataccacctccacggaggaagaggacg 

35 gttgtcctgtcagaatctaccgtgtcttctgccttggcggagctcgccacaaagaccttcggcagctccgaatcgtcggccgt 
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cgacagcggcacggcaacggcctrt^ 

tccatgcccccccttgagggggagcxgggggatcccgatctcagcgacgggtcttggto 
tgaggac^cgtctgctgctcgatgtcctac^ 

gcccatcaatgcactgagcaactctttgctc^glcaccacaacttggtctatgctacaacatctcgcagcgcaagcctgcggc 
5 agaagaaggtcacctttgacagactgcaggtcctggacgarc^^ 

gtccacagttaaggctaaacttctatccgtggaggaagcctgtaagctgacgcccccacattcggccagatct^ 
atggggcaaaggacgtccggaacctatccagcaaggccgttaaccacatccgctccgtgtggaaggacttgctggaagac 
actgagacaccaattgacaccaccatcatggcaaaaaatgaggttttctgcgtccaaccagagaaggggggccgcaagcc 
agctcgccttatcgtattcccagamgggggltcgtgtgtgcgagaaaatggax:ttiacga 

10 gccgtgatgggctcttcatacggattccaatactctcctggacagcgggtcgagttw 
aatgccctatgggcttcgcatatgacaccxgctgtmgactcaacggtcactgagaat^ 
accaatgttgtgacttggcccccgaagccagacaggccataaggtcgctcacagagcggctttacatcgggggc 
ctaattctaaagggcagaactgcggctatcgccggtgccgcgcgagcggtglactgacgaccagctgcggtaato 
catgjtacttgaaggccgctgcggcctgtcgagctgcgaagctccaggactgcacgatgctcgtatgc^ 

15 cgttatctgtgaaagcgcggggacccaagaggacgaggcgag^^ 

ccccccctggggacccgcccaaaccagaatacgacttggagttgataacatcatgctcctccaatgtglcagtcgcgcacg 
atgcatctggcaaaagggtgtactatctcacccgtgaccccaccaccccccttgcgcgggctgcgtgggagacagctagac 
acactccagtcaattcctggctaggcaacatcatcatgtatgcgcccaccttgtgggcaaggatgatcctgatgactcatttctt 
ctccatccttctagctcaggaacaacttgaaaaagaxtagattgtcagatctacggggcctgttactccattgagccacttga 

20 cctacctcagatcattcaacgactccatggccttagcgcattttcactccatagttactctccaggtgagatcaatagggiggct 
tcatgcctcaggaaacttggggtaccgcccttgcgagtctgg^ 
cagggggggagggctgccacttgtggcaagtacctcttcaactgggcagtaaggac^ 
gctgcglcccagttggattlatccagctggttcgttgctggttacagcgggggagacatatatcacagcctgtctc 
ctxcgctggttcatgtggtgcctactcctactttctgtaggggtaggcatctatctactccccaaccga 

25 acactccaggccaataggccatcctgtmmccctttmttmcmmmtttttm 
ttccttttctttcctttggtggctccatcta^ 
gctgatactggcctctctgcagatcaagt 

Other embodiments are within the following claims. While several 
30 embodiments have been shown and described, various modifications may be made 
without departing from the spirit and scope of the present invention. 
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WHAT IS CLAIMED IS: 

1. A nucleic acid molecule comprising a region selected from the 
group consisting of: 

5 a) an altered HCV NS3 encoding region coding for one or more 

NS3 mutations, wherein at least one of said NS3 mutations, identified by reference to 
the amino acid sequence numbering of SEQ. ID, NO. 1, is selected from the group 
consisting of: 

amino acid 1095 being Ala, 
10 amino acid 1202 being Gly, and 
amino acid 1347 being Thr, 

b) an altered HCV NS5 A encoding region coding for one or more 
NS5A mutations, wherein at least one of said NS5A mutations, identified by reference 
to the amino acid sequence numbering of SEQ. ID. NO. 1, is selected from the group 

15 consisting of: 

amino acid 2041 being Thr, 

a Lys insertion between residue 2039 and 2040. 

amino acid 2173 being Phe, 

amino acid 2197 being Phe, 
20 amino acid 2198 being Ser, 

amino acid 2199 being Thr, and 

amino acid 2204 being Arg; and 

c) an altered encephalomyocarditis virus (EMCV) internal 
ribosome entry site (IRES) region containing one or more EMCV IRES mutations, 

25 wherein at least one of said EMCV IRES mutations, identified by reference to the 
nucleotide number of SEQ. ID. NO. 3, is an insertion at nucleotide 1736 of adenine. 

2. The nucleic acid molecule of claim 1, wherein said nucleic acid 
molecule comprises said NS5A encoding region. 

30 

3. The nucleic acid molecule of claim 2, wherein at least two of 
said NS5A adaptive mutations are present. 
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4. Hie nucleic acid molecule of claim 2, further comprising a 
region encoding for a HCV NS3 region, wherein said NS3 region may be the same or 
different than said altered NS3 region. 

5 5. The nucleic acid molecule of claim 4, wherein said nucleic acid 

molecule is an HCV replicon comprising a HCV 5' UTR-PC region, said NS3 
encoding region, an HCV NS4A encoding region, an HCV NS4B encoding region, 
said NS5A encoding region, an HCV NS5B encoding region, and a HCV 3' UTR. 

10 6. The nucleic acid molecule of claim 5, wherein said HCV 

replicon further comprises a sequence encoding for a reporter protein. 

7. The nucleic acid molecule of claim 5, wherein said HCV 
replicon further comprises a sequence encoding for a selection protein. 

15 

8. The nucleic acid molecule of claim 5, wherein said HCV 
replicon further comprises a HCV core encoding region, a HCV El encoding region, a 
HCV E2 encoding region, a HCV p7 encoding region, and a HCV NS2 encoding 
region. 

20 

9. A nucleic acid molecule comprising a region selected from the 
group consisting of: 

a) an altered HCV NS3 encoding region containing one or more 
NS3 mutations, wherein at least one of said NS3 mutations, identified by reference to 

25 the nucleotide numbering of SEQ. ID. NO. 2, is selected from the group consisting of: 
nucleotide 3625 being cytosine, 
nucleotide 3946 being guanine, 
nucleotide 4380 being adenine, 

b) an altered HCV NS5A encoding region containing one or more 
30 NS5A mutations, wherein at least one of said NS5A mutations, identified by reference 

to the nucleotide numbering of SEQ. ID. NO. 2, is selected from the group consisting 
of: 

an insertion of 3 adenine residues between nucleotide 6458 and 6459, 
nucleotide 6463 being cytosine, 
35 nucleotide 6859 being thymine or uracil, 
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nucleotide 693 1 being thymine or uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 6953 being adenine or guanine; and 
5 c) an altered encephalomyocarditis virus (EMCV) internal 

ribosome entry site (IRES) region containing one or more EMCV IRES mutations, 
wherein at least one of said EMCV IRES mutations, identified by reference to the 
nucleotide number of SEQ. ID. NO. 3, is an insertion at nucleotide 1736 of adenine. 

10 10. The nucleic acid molecule of claim 9, wherein said molecule 

comprises said altered NS5A encoding region, and the nucleotide sequence of said 
altered NS5A region is provided for by bases 6258-7598 of SEQ. ID. NO. 2, or the 
RNA version thereof, modified with one or more of said NS5A modifications selected 
from the group consisting of: 

15 an insertion of 3 adenine residues between nucleotide 6458 and 6459, 
nucleotide 6463 being cytosine, 
nucleotide 6859 being thymine or uracil, 
nucleotide 6931 being thymine or uracil, 
nucleotide 6934 being cytosine, 

20 nucleotide 6936 being adenine, and 

nucleotide 6953 being adenine or guanine. 

11. The nucleic acid molecule of claim 10, wherein said molecule 
is an HCV replicon comprising a HCV 5' UTR-PC region, a modified HCV NS3- 

25 NS5B region, and a HCV 3' UTR, wherein said modified NS3-NS5B region 
comprises said altered NS5A region. 

12. The nucleic acid molecule of claim 1 1, wherein said 5' UTR- 
PC region is the RNA version of bases 1-377 of SEQ. ID. NO. 2 and said 3' UTR is 

30 the RNA version of bases 9374-9605 of SEQ. ID. NO. 2. 

13. The nucleic acid molecule of claim 10, wherein said molecule 
is an HCV replicon comprising a HCV 5* UTR-PC region, a modified HCV NS3- 
NS5B region, and a HCV 3' UTR, wherein 

35 said 5' UTR-PC region is the RNA version of bases 1-377 of SEQ. ID. NO. 2; 
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said 3' UTR is the RNA version of bases 9374-9605 of SEQ. ID. NO. 2; and 
said modified NS3-NS5B region consists of the RNA version of bases 3420-9371 of 
SEQ. ID. NO. 2 modified with one or more modifications selected from the group 
consisting of: 
5 nucleotide 4380 being adenine, 
nucleotide 3625 being cytosine, 
nucleotide 3946 being guanine, 

an insertion of 3 adenine residues between nucleotide 6458 and nucleotide 6459, 
nucleotide 6463 being cytosine, 
10 nucleotide 6859 being uracil, 
nucleotide 6931 being uracil, 
nucleotide 6934 being cytosine, 
nucleotide 6936 being adenine, and 
nucleotide 6953 being adenine or guanine. 

15 

14. The nucleic acid molecule of claim 13, wherein said replicon is 
a genomic replicon that further comprises the RNA version of nucleotides 378-3419 
of SEQ. ID. NO. 2. 

20 15. A nucleic acid molecule comprising the nucleic acid base 

sequence of bases 1-7989 of SEQ. ID. NO. 3, or the RNA version thereof, consisting 
of one or more different modifications selected from the group consisting of: 

a) nucleotides 5335-5337 modified to code for arginine; 

b) nucleotides 5242-5244 modified to code for phenylalanine; 
25 c) nucleotides 5314-5316 modified to code for phenylalanine; 

d) nucleotides 5317-5319 modified to code for serine; 

e) nucleotides coding for lysine inserted after nucleotide 4843; 

0 nucleotides 2329-2331 modified to code for glycine, nucleotides 2764-2766 
modified to code for threonine, nucleotides 5242-5244 modified to code for 
30 phenylalanine, and an extra adenosine inserted after nucleotide 1736; 

g) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5242-5244 
modified to modified to code for phenylalanine; 

h) nucleotides 4846^848 modified to code for threonine, and nucleotides 5314-5316 
modified to code for phenylalanine; 
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i) nucleotides 4846-4848 modified to code for threonine, and nucleotides 5317-5319 
modified to code for serine; 

j) nucleotides 2329-2331 modified to code for glycine, and nucleotides coding for 
lysine inserted after nucleotides 4843; 
5 k) nucleotides 5314-5316 modified to code for phenylalanine and nucleotides 5320- 
5322 modified to code for threonine; 

1) nucleotides 4846-4848 modified to code for threonine, nucleotides 5314-53 16 
modified to code for phenylalanine, and nucleotides 5320-5322 modified to code for 
threonine; 

10 m) nucleotides 4846-4848 modified to code for threonine, nucleotides 5314-5316 
modified to code for phenylalanine, and an extra adenosine inserted after nucleotide 
1736; and 

n) nucleotides 5314-5316 modified to code for phenylalanine, nucleotides 5320-5322 
modified to code for threonine, and an extra adenosine inserted after nucleotide 1736; 
15 and 

0) nucleotides 5320-5322 modified to code for threonine. 

16. The nucleic acid of claim 15, wherein said one or more 
different modifications is selected from the group consisting of: 
20 a) C5337A; 

b) C5243TorU; 

c) C5315TorU; 

d) TorU5318C; 

e) AAA inserted after 4843; 

25 0 A2330G, G2764A, C5243T or U, and adenosine inserted 1736; 

g) A4847C and C5243T or U; 

h) A4847C and C53 15T or U; 

1) A4847CandTorU5318C; 

j) A2330G and AAA inserted after 4843; 
30 k) C5315TorUandG5320A; 

I) A4847C,C5315TorU,andG5320A; 
m) A4847C, C5315T or U, and adenosine inserted 1736; 
n) C5315T or U, G5320A and adenosine inserted 1736; and 
o) G5320A. 
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17. The nucleic acid of claim 16, wherein said nucleic acid is RNA 
and comprises said nucleic acid base sequence. 

5 18. The nucleic acid of claim 17, wherein said nucleic acid is RNA 

and consists of said nucleic acid base sequence. 

19. An expression vector comprising a nucleotide sequence coding 
for the nucleic acid molecule of any one of claims 1-18, wherein said nucleotide 

10 sequence is transcriptionally coupled to an exogenous promoter. 

20. A recombinant cell human hepatoma cell, wherein said cell 
comprises the nucleic acid of any one of claims 5-8 and 1 1-18. 

15 21. The recombinant cell of claim 20, wherein said hepatoma cell 

is an Huh-7 cell. 

22. The recombinant cell of claim 20, wherein said cell is derived 
from a Huh-7 cell. 

20 

23. A recombinant cell made by a process comprising the step of 
introducing into a human hepatoma cell the nucleic acid of any one of claims 5-8 and 
11-18. 



25 24. A method of making an HCV replicon enhanced cell 

comprising the steps of: 

a) introducing and maintaining a HCV replicon in a cell; and 

b) curing said cell of said HCV replicon to produce said replicon 

enhanced cell. 

30 

25. The method of claim 24, wherein said cell is a human 

hepatoma cell. 

26. The method of claim 24, wherein said cell is a Huh-7 cell or is 
35 derived from a Huh-7 cell. 
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27. The method of claim 26, further comprising the step of 
confirming the ability of said replicon enhanced cell to maintain an HCV replicon. 

5 28 A method of making an HCV replicon enhanced cell containing 

a functional HCV replicon comprising the steps of: 

a) introducing and maintaining a first HCV replicon in a cell; 

b) curing said cell of said first replicon to produce a cured cell; 

and 

10 c) introducing and maintaining a second HCV replicon into said 

cured cell, wherein said second HCV replicon may be the same or different than said 
first HCV replicon. 

29 The method of claim 28, wherein said cell is a human 

15 hepatoma cell. 

30. The method of claim 29, wherein said human hepatoma cell is 

a Huh-7 cell. 

20 31. The method of claim 30, wherein said human hepatoma cell is 

derived from a Huh-7 cell. 

32. An HCV replicon enhanced cell made by the method of any 
one of claims 24-27. 

25 

33. An HCV replicon enhanced cell containing a HCV replicon 
made by the method of any one of claims 28-31. 

34. A method of measuring the ability of a compound to affect 
30 HCV activity comprising the steps of: 

a) providing said compound to the HCV replicon enhanced cell of 

claim 33; and 

b) measuring the ability of said compound to effect one or more 
replicon activities as a measure of the effect on HCV activity. . 

35 
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35. The method of claim 34, wherein said compound is a ribozyme. 

36. The method of claim 34, wherein said compound in an 
antisense nucleic acid. 

37. The method of claim 34, wherein compound is an organic 

compound 

38. The method of claim 34, wherein said step (b) measures HCV 
protein production. 

39. The method of claim 33, wherein said step (b) measures 
production of RNA transcripts. 
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1 GCCAGCCCCC GATTGGGGGC GACACTCCAC CATAGATCAC TCCCCTGTGA 

51 GGAACTACTG TCTTCACGCA GAAAGCGTCT AGCCATGGCG TTAGTATGAG 

101 TGTCGTGCAG CCTCCAGGAC CCCCCCTCCC GGGAGAGCCA TAGTGGTCTG 

151 CGGAACCGGT GAGTACACCG GAATTGCCAG GACGACCGGG TCCTTTCTTG 

201 GATCAACCCG CTCAATGCCT GGAGATTTGG GCGTGCCCCC GCGAGACTGC 

251 TAGCCGAGTA GTGTTGGGTC GCGAAAGGCC TTGTGGTACT GCCTGATAGG 

301 GTGCTTGCGA GTGCCCCGGG AGGTCTCGTA GACCGTGCAC CATGAGCACG 

351 AATCCTAAAC CTCAAAGAAA AACCAAAGGG CGCGCCATGA TTGAACAAGA 

401 TGGATTGCAC GCAGGTTCTC CGGCCGCTTG GGTGGAGAGG CTATTCGGCT 

451 ATGACTGGGC ACAACAGACA ATCGGCTGCT CTGATGCCGC CGTGTTCCGG 

501 CTGTCAGCGC AGGGGCGCCC GGTTCTTTTT GTCAAGACCG ACCTGTCCGG 

551 TGCCCTGAAT GAACTGCAGG ACGAGGCAGC GCGGCTATCG TGGCTGGCCA 

601 CGACGGGCGT TCCTTGCGCA GCTGTGCTCG ACGTTGTCAC TGAAGCGGGA 

651 AGGGACTGGC TGCTATTGGG CGAAGTGCCG GGGCAGGATC TCCTGTCATC 

701 TCACCTTGCT CCTGCCGAGA AAGTATCCAT CATGGCTGAT GCAATGCGGC 

751 GGCTGCATAC GCTTGATCCG GCTACCTGCC CATTCGACCA CC AAGC G AAA 

801 CATCGCATCG AGCGAGCACG TACTCGGATG GAAGCCGGTC TTGTCGATCA 

851 GGATGATCTG GACGAAGAGC ATCAGGGGCT CGCGCCAGCC GAACTGTTCG 

901 CCAGGCTCAA GGCGCGCATG CCCGACGGCG AGGATCTCGT CGTGACCCAT 

951 GGCGATGCCT GCTTGCCGAA TATCATGGTG GAAAATGGCC GCTTTTCTGG 

1001 ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT CAGGACATAG 

1051 CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 

1101 CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC 

1151 CTTCTATCGC CTTCTTGACG AGTTCTTCTG AGTTTAAACA GACCACAACG 

1201 GTTTCCCTCT AGCGGGATCA ATTCCGCCCC TCTCCCTCCC CCCCCCCTAA 

1251 CGTTACTGGC CGAAGCCGCT TGGAATAAGG CCGGTGTGCG TTTGTCTATA 

1301 TGTTATTTTC CACCATATTG CCGTCTTTTG GCAATGTGAG GGCCCGGAAA 

1351 CCTGGCCCTG TCTTCTTGAC GAGCATTCCT AGGGGTCTTT CCCCTCTCGC 

1401 CAAAGGAATG CAAGGTCTGT TGAATGTCGT GAAGGAAGCA GTTCCTCTGG 

1451 AAGCTTCTTG AAGACAAACA ACGTCTGTAG CGACCCTTTG CAGGCAGCGG 

1501 AACCCCCCAC CTGGCGACAG GTGCCTCTGC GGCCAAAAGC CACGTGTATA 
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1551 AGATACACCT GCAAAGGCGG CACAACCCCA GTGCCACGTT GTGAGTTGGA 

1601 TAGTTGTGGA AAGAGTCAAA TGGCTCTCCT CAAGCGTATT CAACAAGGGG 

1651 CTGAAGGATG CCCAGAAGGT ACCCCATTGT ATGGGATCTG ATCTGGGGCC 

1701 TCGGTGCACA TGCTTTACAT GTGTTTAGTC GAGGTTAAAA AACGTCTAGG 

1751 CCCCCCGAAC CACGGGGACG TGGTTTTCCT TTGAAAAACA CGATAATACC 

1801 ATGGCGCCTA TTACGGCCTA CTCCCAACAG ACGCGAGGCC TACTTGGCTG 

1851 CATCATCACT AGCCTCACAG GCCGGGACAG GAACCAGGTC GAGGGGGAGG 

.1901 TCCAAGTGGT CTCCACCGCA ACACAATCTT TCCTGGCGAC CTGCGTCAAT 

1951 GGCGTGTGTT GGACTGTCTA TCATGGTGCC GGCTCAAAGA CCCTTGCCGG 

2001 CCCAAAGGGC CCAATCACCC AAATGTACAC CAATGTGGAC CAGGACCTCG 

2051 TCGGCTGGCA AGCGCCCCCC GGGGCGCGTT CCTTGACACC ATGCACCTGC 

2101 GGCAGCTCGG ACCTTTACTT GGTCACGAGG CATGCCGATG TCATTCCGGT 

2151 GCGCCGGCGG GGCGACAGCA GGGGGAGCCT ACTCTCCCCC AGGCCCGTCT 

2201 CCTACTTGAA GGGCTCTTCG GGCGGTCCAC TGCTCTGCCC CTCGGGGCAC 

2251 GCTGTGGGCA TCTTTCGGGC TGCCGTGTGC ACCCGAGGGG TTGCGAAGGC 

2301 GGTGGACTTT GTACCCGTCG AGTCTATGGA AACCACTATG CGGTCCCCGG 

2351 TCTTCACGGA CAACTCGTCC CCTCCGGCCG TACCGCAGAC ATTCCAGGTG 

2401 GCCCATCTAC ACGCCCCTAC TGGTAGCGGC AAGAGCACTA AGGTGCCGGC 

2451 TGCGTATGCA GCCCAAGGGT ATAAGGTGCT TGTCCTGAAC CCGTCCGTCG 

2501 CCGCCACCCT AGGTTTCGGG GCGTATATGT CTAAGGCACA TGGTATCGAC 

2551 CCTAACATCA GAACCGGGGT AAGGACCATC ACCACGGGTG CCCCCATCAC 

2601 GTACTCCACC TATGGCAAGT TTCTTGCCGA CGGTGGTTGC TCTGGGGGCG 

2651 CCTATGACAT CATAATATGT GATGAGTGCC ACTCAACTGA CTCGACCACT 

2701 ATCCTGGGCA TCGGCACAGT CCTGGACCAA GCGGAGACGG CTGGAGCGCG 

2751 ACTCGTCGTG CTCGCCACCG CTACGCCTCC GGGATCGGTC ACCGTGCCAC 

2801 ATCCAAACAT CGAGGAGGTG GCTCTGTCCA GCACTGGAGA AATCCCCTTT 

2851 TATGGCAAAG CCATCCCCAT CGAGACCATC AAGGGGGGGA GGCACCTCAT 

2901 TTTCTGCCAT TCCAAGAAGA AATGTGATGA GCTCGCCGCG AAGCTGTCCG 

2951 GCCTCGGACT CAATGCTGTA GCATATTACC GGGGCCTTGA TGTATCCGTC 

3001 ATACCAACTA GCGGAGACGT CATTGTCGTA GCAACGGACG CTCTAATGAC 

3051 GGGCTTTACC GGCGATTTCG ACTCAGTGAT CGACTGCAAT ACATGTGTCA 
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3101 CCCAGACAGT CGACTTCAGC CTGGACCCGA CCTTCACCAT TGAGACGACG 

3151 ACCGTGCCAC AAGACGCGGT GTCACGCTCG CAGCGGCGAG GCAGGACTGG 

3201 TAGGGGCAGG ATGGGCATTT ACAGGTTTGT GACTCCAGGA GAACGGCCCT 

3251 CGGGCATGTT CGATTCCTCG GTTCTGTGCG AGTGCTATGA CGCGGGCTGT 

3301 GCTTGGTACG AGCTCACGCC CGCCGAGACC TCAGTTAGGT TGCGGGCTTA 

3351 CCTAAACACA CCAGGGTTGC CCGTCTGCCA GGACCATCTG GAGTTCTGGG 

3401 AGAGCGTCTT TACAGGCCTC ACCCACATAG ACGCCCATTT CTTGTCCCAG 

3451 ACTAAGCAGG CAGGAGACAA CTTCCCCTAC CTGGTAGCAT ACCAGGCTAC 

3501 GGTGTGCGCC AGGGCTCAGG CTCCACCTCC ATCGTGGGAC CAAATGTGGA 

3551 AGTGTCTCAT ACGGCTAAAG CCTACGCTGC ACGGGCCAAC GCCCCTGCTG 

3601 TATAGGCTGG GAGCCGTTCA AAACGAGGTT ACTACCACAC ACCCCATAAC 

3651 CAAATACATC ATGGCATGCA TGTCGGCTGA CCTGGAGGTC GTCACGAGCA 

3701 CCTGGGTGCT GGTAGGCGGA GTCCTAGCAG CTCTGGCCGC GTATTGCCTG 

3751 ACAACAGGCA GCGTGGTCAT TGTGGGCAGG ATCATCTTGT CCGGAAAGCC 

3801 GGCCATCATT CCCGACAGGG AAGTCCTTTA CCGGGAGTTC GATGAGATGG 

3851 AAGAGTGCGC CTCACACCTC CCTTACATCG AACAGGGAAT GCAGCTCGCC 

3901 GAACAATTCA AACAGAAGGC AATCGGGTTG CTGCAAACAG CCACCAAGCA 

3951 AGCGGAGGCT GCTGCTCCCG TGGTGGAATC CAAGTGGCGG ACCCTCGAAG 

4001 CCTTCTGGGC GAAGCATATG TGGAATTTCA TCAGCGGGAT ACAATATTTA 

4051 GCAGGCTTGT CCACTCTGCC TGGCAACCCC GCGATAGCAT CACTGATGGC 

4101 ATTCACAGCC TCTATCACCA GCCCGCTCAC CACCCAACAT ACCCTCCTGT 

4151 TTAACATCCT GGGGGGATGG GTGGCCGCCC AACTTGCTCC TCCCAGCGCT 

4201 GCTTCTGCTT TCGTAGGCGC CGGCATCGCT GGAGCGGCTG TTGGCAGCAT 

4251 AGGCCTTGGG AAGGTGCTTG TGGATATTTT GGCAGGTTAT GGAGCAGGGG 

4301 TGGCAGGCGC GCTCGTGGCC TTTAAGGTCA TGAGCGGCGA GATGCCCTCC 

4351 ACCGAGGACC TGGTTAACCT ACTCCCTGCT ATCCTCTCCC CTGGCGCCCT 

4401 AGTCGTCGGG GTCGTGTGCG CAGCGATACT GCGTCGGCAC GTGGGCCCAG 

4451 GGGAGGGGGC TGTGCAGTGG ATGAACCGGC TGATAGCGTT CGCTTCGCGG 

4501 GGTAACCACG TCTCCCCCAC GCACTATGTG CCTGAGAGCG ACGCTGCAGC 

4551 ACGTGTCACT CAGATCCTCT CTAGTCTTAC CATCACTCAG CTGCTGAAGA 

4601 GGCTTCACCA GTGGATCAAC GAGGACTGCT CCACGCCATG CTCCGGCTCG 
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4651 


TGGCTAAGAG ATGTTTGGGA TTGGATATGC ACGGTGTTGA CTGATTTCAA 


4701 


GACCTGGCTC CAGTCCAAGC TCCTGCCGCG ATTGCCGGGA GTCCCCTTCT 


4751 


TCTCATGTCA ACGTGGGTAC AAGGGAGTCT GGCGGGGCGA CGGCATCATG 


4801 


CAAACCACCT GCCTATGTGG AGCACAGATC ACCGGACATG 




4851 


TTCCATGAGG ATCGTGGGGC CTAGGACCTG TAGTAACACG 


TGGCATGGAA 


4901 


CATTCCCCAT TAACGCGTAC ACCACGGGCC CCTGCACGCC 


CTCCCCGGCG 


4951 


CCAAATTATT CTAGGGCGCT GTGGCGGGTG GCTGCTGAGG 


AGTACGTGGA 


5001 


GGTTACGCGG GTGGGGGATT TCCACTACGT GACGGGCATG 


ACCACTGACA 


5051 


ACGTAAAGTG CCCGTGTCAG GTTCCGGCCC CCGAATTCTT 


CACAGAAGTG 


5101 


GATGGGGTGC GGTTGCACAG GTACGCTCCA GCGTGCAAAC 


CCCTCCTACG 


5151 


GGAGGAGGTC ACATTCCTGG TCGGGCTCAA TCAATACCTG GTTGGGTCAC 


5201 


AGCTCCCATG CGAGCCCGAA CCGGACGTAG CAGTGCTCAC TTCCATGCTC 


5251 


ACCGACCCCT CQCACATTAC GGCGGAGACG GCTAAGCGTA. GGCTGGCCAG 


5301 


GGGATCTCCC CCCTCCTTGG CCAGCTCATC AGCTAGCCAG CTGTCTGCGC 


5351 


CTTCCTTGAA GGCAACATGC ACTACCCGTC ATGACTCCCC 


GGACGCTGAC 


5401 


CTCATCGAGG CCAACCTCCT GTGGCGGCAG GAGATGGGCG 


GGAACATCAC 


5451 


CCGCGTGGAG TCAGAAAATA AGGTAGTAAT TTTGGACTCT 


TTCGAGCCGC 


5501 


TCCAAGCGGA GGAGGATGAG AGGGAAGTAT CCGTTCCGGC 


GGAGATCCTG 


5551 


CGGAGGTCCA GGAAATTCCC TCGAGCGATG CCCATATGGG 


CACGCCCGGA 


5601 


TTACAACCCT CCACTGTTAG AGTCCTGGAA GGACCCGGAC 


TACGTCCCTC 


5651 


CAGTGGTACA CGGGTGTCCA TTGCCGCCTG CCAAGGCCCC 


TCCGATACCA 


5701 


CCTCCACGGA GGAAGAGGAC GGTTGTCCTG TCAGAATCTA 


CCGTGTCTTC 


5751 


TGCCTTGGCG GAGCTCGCCA CAAAGACCTT CGGCAGCTCC 


GAATCGTCGG 


5801 


CCGTCGACAG CGGCACGGCA ACGGCCTCTC CTGACCAGCC 


CTCCGACGAC 


5851 


GGCGACGCGG GATCCGACGT TGAGTCGTAC TCCTCCATGC 


CCCCCCTTGA 


5901 


GGGGGAGCCG GGGGATCCCG ATCTCAGCGA CGGGTCTTGG 


TCTACCGTAA 


5951 


GCGAGGAGGC TAGTGAGGAC GTCGTCTGCT GCTCGATGTC 


CTACACATGG 


6001 


ACAGGCGCCC TGATCACGCC ATGCGCTGCG GAGGAAACCA AGCTGCCCAT 


6051 


CAATGCACTG AGCAACTCTT TGCTCCGTCA CCACAACTTG 


GTCTATGCTA 


6101 


CAACATCTCG CAGCGCAAGC CTGCGGCAGA AGAAGGTCAC 


CTTTGACAGA 


6151 


CTGCAGGTCC TGGACGACCA CTACCGGGAC GTGCTCAAGG 


AGATGAAGGC 
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6201 GAAGGCGTCC ACAGTTAAGG CTAAACTTCT ATCCGTGGAG GAAGCCTGTA 

6251 AGCTGACGCC CCCACATTCG GCCAGATCTA AATTTGGCTA TGGGGCAAAG 

6301 GACGTCCGGA ACCTATCCAG CAAGGCCGTT AACCACATCC GCTCCGTGTG 

6351 GAAGGACTTG CTGGAAGACA CTGAGACACC AATTGACACC ACCATCATGG 

6401 CAAAAAATGA GGTTTTCTGC GTCCAACCAG AGAAGGGGGG CCGCAAGCCA 

6451 GCTCGCCTTA TCGTATTCCC AGATTTGGGG GTTCGTGTGT GCGAGAAAAT 

6501 GGCCCTTTAC GATGTGGTCT CCACCCTCCC TCAGGCCGTG ATGGGCTCTT 

6551 CATACGGATT CCAATACTCT CCTGGACAGC GGGTCGAGTT CCTGGTGAAT 

6601 GCCTGGAAAG CGAAGAAATG CCCTATGGGC TTCGCATATG ACACCCGCTG 

6651 TTTTGACTCA ACGGTCACTG AGAATGACAT CCGTGTTGAG GAGTCAATCT 

6701 ACCAATGTTG TGACTTGGCC CCCGAAGCCA GACAGGCCAT AAGGTCGCTC 

6751 ACAGAGCGGC TTTACATCGG GGGCCCCCTG ACTAATTCTA AAGGGCAGAA 

6801 CTGCGGCTAT CGCCGGTGCC GCGCGAGCGG TGTACTGACG ACCAGCTGCG 

6851 GTAATACCCT CACATGTTAC TTGAAGGCCG CTGCGGCCTG TCGAGCTGCG 

6901 AAGCTCCAGG ACTGCACGAT GCTCGTATGC GGAGACGACC TTGTCGTTAT 

6951 CTGTGAAAGC GCGGGGACCC AAGAGGACGA GGCGAGCCTA CGGGCCTTCA 

7001 CGGAGGCTAT GACTAGATAC TCTGCCCCCC CTGGGGACCC GCCCAAACCA 

7051 GAATACGACT TGGAGTTGAT AACATCATGC TCCTCCAATG TGTCAGTCGC 

7101 GCACGATGCA TCTGGCAAAA GGGTGTACTA TCTCACCCGT GACCCCACCA 

7151 CCCCCCTTGC GCGGGCTGCG TGGGAGACAG CTAGACACAC TCCAGTCAAT 

7201 TCCTGGCTAG GCAACATCAT CATGTATGCG CCCACCTTGT GGGCAAGGAT 

7251 GATCCTGATG ACTCATTTCT TCTCCATCCT TCTAGCTCAG GAACAACTTG 

7301 AAAAAGCCCT AGATTGTCAG ATCTACGGGG CCTGTTACTC CATTGAGCCA 

7351 CTTGACCTAC CTCAGATCAT TCAACGACTC CATGGCCTTA GCGCATTTTC 

7401 ACTCCATAGT TACTCTCCAG GTGAGATCAA TAGGGTGGCT TCATGCCTCA 

7451 GGAAACTTGG GGTACCGCCC TTGCGAGTCT GGAGACATCG GGCCAGAAGT 

7501 GTCCGCGCTA GGCTACTGTC CCAGGGGGGG AGGGCTGCCA CTTGTGGCAA 

7551 GTACCTCTTC AACTGGGCAG TAAGGACCAA GCTCAAACTC ACTCCAATCC 

7601 CGGCTGCGTC CCAGTTGGAT TTATCCAGCT GGTTCGTTGC TGGTTACAGC 

7651 GGGGGAGACA TATATCACAG CCTGTCTCGT GCCCGACCCC GCTGGTTCAT 

7701 GTGGTGCCTA CTCCTACTTT CTGTAGGGGT AGGCATCTAT CTACTCCCCA 
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7751 ACCGATGAAC GGGGAGCTAA ACACTCCAGG CCAATAGGCC ATCCTGTTTT 

7801 TTTCCCTTTT TTTTTTTCTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 

7851 TTCTCCTTTT TTTTTCCTCT TTTTTTCCTT TTCTTTCCTT TGGTGGCTCC 

7901 ATCTTAGCCC TAGTCACGGC TAGCTGTGAA AGGTCCGTGA GCCGCTTGAC 

7951 TGCAGAGAGT GCTGATACTG GCCTCTCTGC AGATCAAGTA CTTCTAGAGA 

8001 ATTCTAGCTT GGCGTAATCA TGGTCATAGC TGTTTCCTGT GTGAAATTGT 

8051 TATCAGCTCA CAATTCCACA CAACATACGA GCCGGAAGCA TAAAGTGTAA 

8101 AGCCTGGGAT GCCTAATGAG TGAGCTAACT CACATTAGTT GCGTTGCGCT 

8151 CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT CCATTAGTGA 

8201 ATCGTCCAAC GCACGGGGAG AGGCGGTTTG CGTATTGGGC GCACTTCCGC 

8251 TTCCTCGCTC ACTGACTCGC TGCGCTCGTT CGTTCGGCTG CGGCGAGCCG 

8301 TATCAGCTCA CTCAAAGGCG GTAATACGGT TATCCACAGA ATCAGGGGAT 

8351 AACGCAGGAA AGACCATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 

8401 TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 

8451 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 

8501 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 

8551 TGTTCCGACC CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG 

8601 GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT CAGTTCGGTG 

8651 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 

8701 CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 

8751 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 

8801 GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 

8851 CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 

8901 TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA ACAAACCACC 

8951 GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 

9001 AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 

9051 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 

9101 AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 

9151 CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGC TTAATC A 

9201 GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 

9251 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 
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9301 CCCCAGTGCT GCAATGATAC CGCGAGAACC ACGCTCACCC GCACCAGATT 

9351 TATCAGCAAT AAACCAGCCA GCCGGAAGTG CGCTGCGGAG AAGTGGTCCT 

9401 GCAACTTTAT CCGCCTCCAT CCAGTCTATT AGTTGTTGCC GGGAAGCTAG 

9451 AGTAAGTAGT TCGCCAGTCA GCAGTTTGCG TAACGTCGTT GCCATAGCAA 

9501 CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC 

9551 GGCTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA 

9601 AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 

9651 CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 

9701 ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 

9751 ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA 

9801 TACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT GCTCATCATT 

9851 GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 

9901 ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 

9951 TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 

10001 GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 

10051 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 

10101 GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC 

10151 ACATTTCCCC GAAAAGTGCC ACCTGACGTC TAAGAAACCA TTATTACCAT 

10201 GACATTAACC TATAAAAATA GGCGTATCAC GAAGCCCTTT CGTCTAGCGC 

10251 GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACTTGCAGCT CCCGCAGACG 

10301 GTCACAGCTT GTCTGTAAGC GGATGCCGGG AGCAGGCAAG CCCGTCAGGG 

10351 CGCGTCAGTG GGTGTTGGCG GGTGTCGGGG CTGGCTTAAC TATGCGGCAT 

10401 CAGAGCAGAT TGTACTGAGA GTACACCAGA TGCGGTGTGA AATACCGCAC 

10451 AGATGCGTAA GGAGAAAATA CCGCATCAGC CTCCATTCGC CATTCAGACT 

10501 CCGCAACTGT TGGGAAGGGC GGTCAGTACG CGCTTCTTCG CTATTACGCC 

10551 AACTGGCGAA AGGGGGATGT GCTGCAAGGC GATTAAGTTG GGTAACGCCA 

10601 GGGTTTTCCC AATCACGACG TTGTAAAACG ACAGCCAATG AATTGAAGCT 

10651 TATTAATTCT AGACTGAAGC TTTTAATACG ACTCACTATA (SEQ. ID. NO.:3) 
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SEQUENCE LISTING 

<110> Istituto Di Ricerche Di Biologia Molecolare P. Angeletti S.P.A. 

<120> HEPATITIS C VIRUS REPLICONS AND REPLICON 
ENHANCED CELLS 

<130> IT0003 PCT 

<150> 60/263,479 
<151> 2001-01-23 

<160> 13 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3010 
<212> PRT 

<213> Con 1 HCV isolate nucleic acid 
<400> 1 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

50 55 60 

He Pro Lys Ala Arg Gin Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

100 105 HO 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 

130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr 

180 185 190 

Glu Val Arg Asn Val Ser Gly Val Tyr His Val Thr Asn Asp Cys Ser 

195 200 205 

Asn Ala Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 

210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 

245 250 255 

Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Leu Cys 

260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ala 

275 280 285 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 

290 295 300 

Asn Cys Ser He Tyr Pro Gly His Val Thr Gly His Arg Met Ala Trp 
305 310 315 320 
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Asp Met Met Met Asn Trp Ser Pro Thr Ala Ala Leu Val Val Ser Gin 

325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 

340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 

355 360 365 

Ala Lys Val Leu He Val Met Leu Leu Phe Ala Gly Val Asp Gly Gly 

370 375 380 

Thr Tyr Val Thr Gly Gly Thr Met Ala Lys Asn Thr Leu Gly He Thr 
385 390 395 400 

Ser Leu Phe Ser Pro Gly Ser Ser Gin Lys He Gin Leu Val Asn Thr 

405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 

420 425 430 

Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Val His Lys Phe Asn 

435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ala Ser Cys Ser Pro He Asp Ala 

450 455 460 

Phe Ala Gin Gly Trp Gly Pro lie Thr Tyr Asn Glu Ser His Ser Ser 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie 

485 490 495 

Val Pro Ala Ala Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Ser 

515 520 525 

Trp Gly Glu Asn Glu Thr Asp Val Leu Leu Leu Asn Asn Thr Arg Pro 

530 535 540 

Pro Gin Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe 
545 550 555 560 

Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly He Gly Asn 

565 570 575 

Lys Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 

580 585 590 

Thr Tyr Thr Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Leu 

595 600 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 

610 615 620 

Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Leu 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asn Leu Glu Asp 

645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 

660 665 670 

Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 

675 680 685 

Leu lie His Leu His Gin Asn Val Val Asp Val Gin Tyr Leu Tyr Gly 

690 695 700 

lie Gly Ser Ala Val Val Ser Phe Ala lie Lys Trp Glu Tyr Val Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp 

725 730 735 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 

740 745 750 

Val Leu Asn Ala Ala Ser Val Ala Gly Ala His Gly lie Leu Ser Phe 

755 760 765 

Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 

770 775 780 

Gly Ala Ala Tyr Ala Leu Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala Met Asp Arg Glu Met Ala Ala 
805 810 815 
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Ser Cys Gly Gly Ala Val Phe Val Gly Leu lie Leu Leu Thr Leu Ser 

820 825 830 

Pro His Tyr Lys Leu Phe Leu Ala Arg Leu He Trp Trp Leu Gin Tyr 

835 840 845 

Phe He Thr Arg Ala Glu Ala His Leu Gin Val Trp He Pro Pro Leu 

850 855 860 

Asn Val Arg Gly Gly Arg Asp Ala Val He Leu Leu Thr Cys Ala He 
865 870 875 880 

His Pro Glu Leu He Phe Thr He Thr Lys He Leu Leu Ala He Leu 

885 890 895 

Gly Pro Leu Met Val Leu Gin Ala Gly He Thr Lys Val Pro Tyr Phe 

900 905 910 

Val Arg Ala His Gly Leu He Arg Ala Cys Met Leu Val Arg Lys Val 

915 920 925 

Ala Gly Gly His Tyr Val Gin Met Ala Leu Met Lys Leu Ala Ala Leu 

930 935 940 

Thr Gly Thr Tyr Val Tyr Asp His Leu Thr Pro Leu Arg Asp Trp Ala 
945 950 955 960 

His Ala Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Val Val Phe 

965 970 975 

Ser Asp Met Glu Thr Lys Val He Thr Trp Gly Ala Asp Thr Ala Ala 

980 985 990 

Cys Gly Asp lie lie Leu Gly Leu Pro Val Ser Ala Arg Arg Gly Arg 

995 1000 1005 

Glu He His Leu Gly Pro Ala Asp Ser Leu Glu Gly Gin Gly Trp Arg 

1010 1015 1020 

Leu Leu Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu 
1025 1030 1035 . 1040 

Gly Cys He He Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu 

1045 1050 1055 

Gly Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr 

1060 1065 1070 

Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys 

1075 1080 1085 

Thr Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val 

1090 1095 1100 

Asp Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu 
1105 1110 1115 1120 

Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His 

1125 1130 1135 

Ala Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu 

1140 1145 1150 

Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro 

1155 1160 1165 

Leu Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val 

1170 1175 1180 

Cys Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser 
1185 1190 1195 1200 

Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro 

1205 1210 . 1215 

Pro Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr 

1220 1225 1230 

Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly 

1235 1240 1245 

Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe 

1250 1255 1260 

Gly Ala Tyr Met Ser Lys Ala His Gly He Asp Pro Asn He Arg Thr 
1265 1270 1275 1280 

Gly Val Arg Thr He Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr 

1285 1290 1295 

Gly Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He 
1300 1305 1310 
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lie lie Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr lie Leu Gly 

1315 1320 1325 

lie Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val 

1330 1335 1340 

Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro 
1345 1350 1355 1360 

Asn lie Glu Glu Val Ala Leu Ser Ser Thr Gly Glu lie Pro Phe Tyr 

1365 1370 1375 

Gly Lys Ala lie Pro lie Glu Thr lie Lys Gly Gly Arg His Leu lie 

1380 1385 1390 

Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser 

1395 1400 1405 

Gly Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser 

1410 1415 1420 

Val He Pro Thr Ser Gly Asp Val He Val Val Ala Thr Asp Ala Leu 
1425 1430 1435 1440 

Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr 

1445 1450 1455 

Cys Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He 

1460 1465 1470 

Glu Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg 

1475 1480 1485 

Gly Arg Thr Gly Arg Gly Arg Met Gly He Tyr Arg Phe Val Thr Pro 

1490 1495 1500 

Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys 
1505 1510 1515 1520 

Tyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser 

1525 1530 1535 

Val Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin 

1540 1545 1550 

Asp His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He 

1555 1560 1565 

Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro 

1570 1575 1580 

Tyr Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro 
1585 1590 1595 1600 

Pro Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro 

1605 1610 1615 

Thr Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin 

1620 1625 1630 

Asn Glu Val Thr Thr Thr His Pro He Thr Lys Tyr He Met Ala Cys 

1635 1640 1645 

Met Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly 

1650 1655 1660 

Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val 
1665 1670 1675 1680 

Val He Val Gly Arg He He Leu Ser Gly Lys Pro Ala He He Pro 

1685 1690 1695 

Asp Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala 

1700 1705 1710 

Ser His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe 

1715 1720 1725 

Lys Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu 

1730 1735 1740 

Ala Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe 
1745 1750 1755 1760 

Trp Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala 

1765 1770 1775 

Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala 

1780 1785 1790 

Phe Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu 
1795 1800 1805 
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Phe Asn lie Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser 

1810 1815 1820 

Ala Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly 
1825 1830 1835 1840 

Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly 

1845 1850 1855 

Ala Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu 

1860 1865 1870 

Met Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser 

1875 1880 1885 

Pro Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg 

1890 1895 1900 

His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He 
1905 1910 1915 1920 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro 

1925 1930 1935 

Glu Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr 

1940 1945 1950 

He Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys 

1955 1960 1965 

Ser Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He 

1970 1975 1980 

Cys Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu 
1985 1990 1995 2000 

Pro Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys 

2005 2010 2015 

Gly Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly 

2020 2025 2030 

Ala Gin He Thr Gly His Val Lys Asn Gly Ser Met Arg He Val Gly 

2035 2040 2045 

Pro Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala 

2050 2055 2060 

Tyr Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg 
2065 2070 2075 2080 

Ala Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val 

2085 2090 2095 

Gly Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys 

2100 2105 2110 

Pro Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val 

2115 2120 2125 

Arg Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu 

2130 2135 2140 

Val Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu 
2145 2150 2155 2160 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr 

2165 2170 2175 

Asp Pro Ser His He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg 

2180 2185 2190 

Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala 

2195 2200 2205 

Pro Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala 

2210 2215 2220 

Asp Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn 
2225 2230 2235 2240 

He Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe 

2245 2250 2255 

Glu Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala 

2260 2265 2270 

Glu He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp 

2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro 
2290 2295 2300 
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Asp Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys 
2305 2310 2315 2320 

Ala Pro Pro lie Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser 

2325 2330 2335 

Glu Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe 

2340 2345 2350 

Gly Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser 

2355 2360 2365 

Pro Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser 

2370 2375 2380 

Tyr Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
2385 2390 2395 2400 

Ser Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val 

2405 2410 2415 

Val Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro 

2420 2425 2430 

Cys Ala Ala Glu Glu Thr Lys Leu Pro lie Asn Ala Leu Ser Asn Ser 

2435 2440 2445 

Leu Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala 

2450 2455 2460 

Ser Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp 
2465 2470 2475 2480 

Asp His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr 

2485 2490 2495 

Val Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro 

2500 2505 2510 

Pro His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg 

2515 2520 2525 

Asn Leu Ser Ser Lys Ala Val Asn His lie Arg Ser Val Trp Lys Asp 

2530 2535 2540 

Leu Leu Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys 
2545 2550 2555 2560 

Asn Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala 

2565 2570 2575 

Arg Leu He Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met 

2580 2585 2590 

Ala Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser 

2595 2600 2605 

Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val 

2610 2615 2620 

Asn Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr 
2625 2630 2635 2640 

Arg Cys Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu 

2645 2650 ' 2655 

Ser He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He 

2660 2665 2670 

Arg Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser 

2675 2680 2685 

Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu 

2690 2695 2700 

Thr Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala 
2705 2710 2715 2720 

Ala Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly 

2725 2730 2735 

Asp Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu 

2740 2745 2750 

£la Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro 

2755 2760 2765 

Pro Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser 

2770 2775 2780 

Cys Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val 
2785 2790 2795 2800 
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Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp 

2805 2810 2815 

Glu Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asri lie lie 

2820 2825 2830 

Met Tyr Ala Pro Thr Leu Trp Ala Arg Met lie Leu Met Thr His Phe 

2835 2840 2845 

Phe Ser lie Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys 

2850 2855 2860 

Gin lie Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin 
2865 2870 2875 2880 

He He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr 

2885 2890 2895 

Ser Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly 

2900 2905 2910 

Val Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala 

2915 2920 2925 

Arg Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu 

2930 2935 2940 

Phe Asn Trp Ala Val Arg Thr Lys Leu Lys lieu Thr Pro He Pro Ala 
2945 2950 2955 2960 

Ala Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly 

2965 2970 2975 

Gly Asp He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met 

2980 2985 2990 

Trp Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro 
2995 3000 3005 

Asn Arg 
3010 

<210> 2 
<211> 9605 
<212> DNA 

<213> Con 1 HCV isolate amino acid 
<400> 2 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 

ctcaaagaaa aaccaaacgt aacaccaacc gccgcccaca ggacgtcaag ttcccgggcg 420 

gtggtcagat cgtcggtgga gtttacctgt tgccgcgcag gggccccagg ttgggtgtgc 480 

gcgcgactag gaagacttcc gagcggtcgc aacctcgtgg aaggcgacaa cctatcccca 540 

aggctcgcca gcccgagggt agggcctggg ctcagcccgg gtacccctgg cccctctatg 600 

gcaatgaggg cttggggtgg gcaggatggc tcctgtcacc ccgtggctct cggcctagtt 660 

ggggccccac ggacccccgg cgtaggtcgc gcaatttggg taaggtcatc gataccctca 720 

cgtgcggctt cgccgatctc atggggtaca ttccgctcgt cggcgccccc ctagggggcg 780 

ctgccagggc cctggcgcat ggcgtccggg ttctggagga cggcgtgaac tatgcaacag 840 

ggaatctgcc cggttgctcc ttttctatct tccttttggc tttgctgtcc tgtttgacca 900 

tcccagcttc cgcttatgaa gtgcgcaacg tatccggagt gtaccatgtc acgaacgact 960 

. gctccaacgc aagcattgtg tatgaggcag cggacatgat catgcatacc cccgggtgcg 1020 

tgccctgcgt tcgggagaac aactcctccc gctgctgggt agcgctcact cccacgctcg 1080 

cggccaggaa cgctagcgtc cccactacga cgatacgacg ccatgtcgat ttgctcgttg 1140 

gggcggctgc tctctgctcc gctatgtacg tgggagatct ctgcggatct gttttcctcg 1200 

tcgcccagct gttcaccttc tcgcctcgcc ggcacgagac agtacaggac tgcaattgct 1260 

caatatatcc cggccacgtg acaggtcacc gtatggcttg ggatatgatg atgaactggt 1320 

cacctacagc agccctagtg gtatcgcagt tactccggat cccacaagct gtcgtggata 1380 

tggtggcggg ggcccattgg ggagtcctag cgggccttgc ctactattcc atggtgggga 1440 

actgggctaa ggttctgatt gtgatgctac tctttgccgg cgttgacggg ggaacctatg 1500 

tgacaggggg gacgatggcc aaaaacaccc tcgggattac gtccctcttt tcacccgggt 1560 

catcccagaa aatccagctt gtaaacacca acggcagctg gcacatcaac aggactgccc 1620 
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tgaactgcaa tgactccctc aacactgggt tccttgctgc gctgttctac gtgcacaagt 1680 

tcaactcatc tggatgccca gagcgcatgg ccagctgcag ccccatcgac gcgttcgctc 1740 

aggggtgggg gcccatcact tacaatgagt cacacagctc ggaccagagg ccttattgtt 1800 

ggcactacgc accccggccg tgcggtatcg tacccgcggc gcaggtgtgt ggtccagtgt 1860 

actgcttcac cccaagccct gtcgtggtgg ggacgaccga ccggttcggc gtccctacgt 1920 

acagttgggg ggagaatgag acggacgtgc tgcttcttaa caacacgcgg ccgccgcaag 1980 

gcaactggtt tggctgtaca tggatgaata gcactgggtt caccaagacg tgcgggggcc 2040 

ccccgtgtaa catcgggggg atcggcaata aaaccttgac ctgccccacg gactgcttcc 2100 

ggaagcaccc cgaggccact tacaccaagt gtggttcggg gccttggttg acacccagat 2160 

gcttggtcca ctacccatac aggctttggc actacccctg cactgtcaac tttaccatct 2220 

tcaaggttag gatgtacgtg gggggagtgg agcacaggct cgaagccgca tgcaattgga 2280 

ctcgaggaga gcgttgtaac ctggaggaca gggacagatc agagcttagc ccgctgctgc 2340 

tgtctacaac ggagtggcag gtattgccct gttccttcac caccctaccg gctctgtcca 2400 

ctggtttgat ccatctccat cagaacgtcg tggacgtaca atacctgtac ggtatagggt 2460 

cggcggttgt ctcctttgca atcaaatggg agtatgtcct gttgctcttc cttcttctgg 2520 

cggacgcgcg cgtctgtgcc tgcttgtgga tgatgctgct gatagctcaa gctgaggccg 2580 

ccctagagaa cctggtggtc ctcaacgcgg catccgtggc cggggcgcat ggcattctct 2640 

ccttcctcgt gttcttctgt gctgcctggt acatcaaggg caggctggtc cctggggcgg 2700 

catatgccct ctacggcgta tggccgctac tcctgctcct gctggcgtta ccaccacgag 2760 

catacgccat ggaccgggag atggcagcat cgtgcggagg cgcggttttc gtaggtctga 2820 

tactcttgac cttgtcaccg cactataagc tgttcctcgc taggctcata tggtggttac 2880 

aatattttat caccagggcc gaggcacact tgcaagtgtg gatccccccc ctcaacgttc 2940 

gggggggccg cgatgccgtc atcctcctca cgtgcgcgat ccacccagag ctaatcttta 3000 

ccatcaccaa aatcttgctc gccatactcg gtccactcat ggtgctccag gctggtataa 3060 

ccaaagtgcc gtacttcgtg cgcgcacacg ggctcattcg tgcatgcatg ctggtgcgga 3120 

aggttgctgg gggtcattat gtccaaatgg ctctcatgaa gttggccgca ctgacaggta 3180 

cgtacgttta tgaccatctc accccactgc gggactgggc ccacgcgggc ctacgagacc 3240 

ttgcggtggc agttgagccc gtcgtcttct ctgatatgga gaccaaggtt atcacctggg 3300 

gggcagacac cgcggcgtgt ggggacatca tcttgggcct gcccgtctcc gcccgcaggg 3360 

ggagggagat acatctggga ccggcagaca gccttgaagg gcaggggtgg cgactcctcg 3420 

cgcctattac ggcctactcc caacagacgc gaggcctact tggctgcatc atcactagcc 3480 

tcacaggccg ggacaggaac caggtcgagg gggaggtcca agtggtctcc accgcaacac 3540 

aatctttcct ggcgacctgc gtcaatggcg tgtgttggac tgtctatcat ggtgccggct 3600 

caaagaccct tgccggccca aagggcccaa tcacccaaat gtacaccaat gtggaccagg 3660 

acctcgtcgg ctggcaagcg ccccccgggg cgcgttcctt gacaccatgc acctgcggca 3720 

gctcggacct ttacttggtc acgaggcatg ccgatgtcat tccggtgcgc cggcggggcg 3780 

acagcagggg gagcctactc tcccccaggc ccgtctccta cttgaagggc tcttcgggcg 3840 

gtccactgct ctgcccctcg gggcacgctg tgggcatctt tcgggctgcc gtgtgcaccc 3900 

gaggggttgc gaaggcggtg gactttgtac ccgtcgagtc tatggaaacc actatgcggt 3960 

ccccggtctt cacggacaac tcgtcccctc cggccgtacc gcagacattc caggtggccc 4020 

atctacacgc ccctactggt agcggcaaga gcactaaggt gccggctgcg tatgcagccc 4080 

aagggtataa ggtgcttgtc ctgaacccgt ccgtcgccgc caccctaggt ttcggggcgt 4140 

atatgtctaa ggcacatggt atcgacccta acatcagaac cggggtaagg accatcacca 4200 

cgggtgcccc catcacgtac tccacctatg gcaagtttct tgccgacggt ggttgctctg 4260 

ggggcgccta tgacatcata atatgtgatg agtgccactc aactgactcg accactatcc 4320 

tgggcatcgg cacagtcctg gaccaagcgg agacggctgg agcgcgactc gtcgtgctcg 4380 

ccaccgctac gcctccggga tcggtcaccg tgccacatcc aaacatcgag gaggtggctc 4440 

tgtccagcac tggagaaatc cccttttatg gcaaagccat ccccatcgag accatcaagg 4500 

gggggaggca cctcattttc tgccattcca agaagaaatg tgatgagctc gccgcgaagc 4560 

tgtccggcct cggactcaat gctgtagcat attaccgggg ccttgatgta tccgtcatac 4620 

caactagcgg agacgtcatt gtcgtagcaa cggacgctct aatgacgggc tttaccggcg 4680 

atttcgactc agtgatcgac tgcaatacat gtgtcaccca gacagtcgac ttcagcctgg 4740 

acccgacctt caccattgag acgacgaccg tgccacaaga cgcggtgtca cgctcgcagc 4800 

ggcgaggcag gactggtagg ggcaggatgg gcatttacag gtttgtgact ccaggagaac 4860 

ggccctcggg catgttcgat tcctcggttc tgtgcgagtg ctatgacgcg ggctgtgctt 4920 

ggtacgagct cacgcccgcc gagacctcag ttaggttgcg ggcttaccta aacacaccag 4980 

ggttgcccgt ctgccaggac catctggagt tctgggagag cgtctttaca ggcctcaccc 5040 

acatagacgc ccatttcttg tcccagacta agcaggcagg agacaacttc ccctacctgg 5100 

tagcatacca ggctacggtg tgcgccaggg ctcaggctcc acctccatcg tgggaccaaa 5160 

tgtggaagtg tctcatacgg ctaaagccta cgctgcacgg gccaacgccc ctgctgtata 5220 

ggctgggagc cgttcaaaac gaggttacta ccacacaccc cataaccaaa tacatcatgg 5280 

catgcatgtc ggctgacctg gaggtcgtca cgagcacctg ggtgctggta ggcggagtcc 5340 
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tagcagctct ggccgcgtat tgcctgacaa caggcagcgt ggtcattgtg ggcaggatca 5400 

tcttgtccgg aaagccggcc atcattcccg acagggaagt cctttaccgg gagttcgatg 5460 

agatggaaga gtgcgcctca cacctccctt acatcgaaca gggaatgcag ctcgccgaac 5520 

aattcaaaca gaaggcaatc gggttgctgc aaacagccac caagcaagcg gaggctgctg 5580 

ctcccgtggt ggaatccaag tggcggaccc tcgaagcctt ctgggcgaag catatgtgga 5640 

atttcatcag cgggatacaa tatttagcag gcttgtccac tctgcctggc aaccccgcga 5700 

tagcatcact gatggcattc acagcctcta tcaccagccc gctcaccacc caacataccc 5760 

tcctgtttaa catcctgggg ggatgggtgg ccgcccaact tgctcctccc agcgctgctt 5820 

ctgctttcgt aggcgccggc atcgctggag cggctgttgg cagcataggc cttgggaagg 5880 

tgcttgtgga tattttggca ggttatggag caggggtggc aggcgcgctc gtggccttta 5940 

aggtcatgag cggcgagatg ccctccaccg aggacctggt taacctactc cctgctatcc 6000 

tctcccctgg cgccctagtc gtcggggtcg tgtgcgcagc gatactgcgt cggcacgtgg 6060 

gcccagggga gggggctgtg cagtggatga accggctgat agcgttcgct tcgcggggta 6120 

accacgtctc ccccacgcac tatgtgcctg agagcgacgd tgcagcacgt gtcactcaga 6180 

tcctctctag tcttaccatc actcagctgc tgaagaggct tcaccagtgg atcaacgagg 6240 

actgctccac gccatgctcc ggctcgtggc taagagatgt ttgggattgg atatgcacgg 6300 

tgttgactga tttcaagacc tggctccagt ccaagctcct gccgcgattg ccgggagtcc 6360 

ccttcttctc atgtcaacgt gggtacaagg gagtctggcg gggcgacggc atcatgcaaa 6420 

ccacctgccc atgtggagca cagatcaccg gacatgtgaa aaacggttcc atgaggatcg 6480 

tggggcctag gacctgtagt aacacgtggc atggaacatt ccccattaac gcgtacacca 6540 

cgggcccctg cacgccctcc ccggcgccaa attattctag ggcgctgtgg cgggtggctg 6600 

ctgaggagta cgtggaggtt acgcgggtgg gggatttcca ctacgtgacg ggcatgacca 6660 

ctgacaacgt aaagtgcccg tgtcaggttc cggcccccga attcttcaca gaagtggatg 6720 

gggtgcggtt gcacaggtac gctccagcgt gcaaacccct cctacgggag gaggtcacat 6780 

tcctggtcgg gctcaatcaa tacctggttg ggtcacagct cccatgcgag cccgaaccgg 6840 

acgtagcagt gctcacttcc atgctcaccg acccctccca cattacggcg gagacggcta 6900 

agcgtaggct ggccagggga tctcccccct ccttggccag ctcatcagct agccagctgt 6960 

ctgcgccttc cttgaaggca acatgcacta cccgtcatga ctccccggac gctgacctca 7020 

tcgaggccaa cctcctgtgg cggcaggaga tgggcgggaa catcacccgc gtggagtcag 7080 

aaaataaggt agtaattttg gactctttcg agccgctcca agcggaggag gatgagaggg 7140 

aagtatccgt tccggcggag atcctgcgga ggtccaggaa attccctcga gcgatgccca 7200 

tatgggcacg cccggattac aaccctccac tgttagagtc ctggaaggac ccggactacg 7260 

tccctccagt ggtacacggg tgtccattgc cgcctgccaa ggcccctccg ataccacctc 7320 

cacggaggaa gaggacggtt gtcctgtcag aatctaccgt gtcttctgcc ttggcggagc 7380 

tcgccacaaa gaccttcggc agctccgaat cgtcggccgt cgacagcggc acggcaacgg 7440 

cctctcctga ccagccctcc gacgacggcg acgcgggatc cgacgttgag tcgtactcct 7500 

ccatgccccc ccttgagggg gagccggggg atcccgatct cagcgacggg tcttggtcta 7560 

ccgtaagcga ggaggctagt gaggacgtcg tctgctgctc gatgtcctac acatggacag 7620 

gcgccctgat cacgccatgc gctgcggagg aaaccaagct gcccatcaat gcactgagca 7680 

actctttgct ccgtcaccac aacttggtct atgctacaac atctcgcagc gcaagcctgc 7740 

ggcagaagaa ggtcaccttt gacagactgc aggtcctgga cgaccactac cgggacgtgc 7800 

tcaaggagat gaaggcgaag gcgtccacag ttaaggctaa acttctatcc gtggaggaag 7860 

cctgtaagct gacgccccca cattcggcca gatctaaatt tggctatggg gcaaaggacg 7920 

tccggaacct atccagcaag gccgttaacc acatccgctc cgtgtggaag gacttgctgg 7980 

aagacactga gacaccaatt gacaccacca tcatggcaaa aaatgaggtt ttctgcgtcc 8040 

aaccagagaa ggggggccgc aagccagctc gccttatcgt attcccagat ttgggggttc 8100 

gtgtgtgcga gaaaatggcc ctttacgatg tggtctccac cctccctcag gccgtgatgg 8160 

gctcttcata cggattccaa tactctcctg gacagcgggt cgagttcctg gtgaatgcct 8220 

ggaaagcgaa gaaatgccct atgggcttcg catatgacac ccgctgtttt gactcaacgg 8280 

tcactgagaa tgacatccgt gttgaggagt caatctacca atgttgtgac ttggcccccg 8340 

aagccagaca ggccataagg tcgctcacag agcggcttta catcgggggc cccctgacta 8400 

attctaaagg gcagaactgc ggctatcgcc ggtgccgcgc gagcggtgta ctgacgacca 8460 

gctgcggtaa taccctcaca tgttacttga aggccgctgc ggcctgtcga gctgcgaagc 8520 

tccaggactg cacgatgctc gtatgcggag acgaccttgt cgttatctgt gaaagcgcgg 8580 

ggacccaaga ggacgaggcg agcctacggg ccttcacgga ggctatgact agatactctg 8640 

ccccccctgg ggacccgccc aaaccagaat acgacttgga gttgataaca tcatgctcct 8700 

ccaatgtgtc agtcgcgcac gatgcatctg gcaaaagggt gtactatctc acccgtgacc 8760 

ccaccacccc ccttgcgcgg gctgcgtggg agacagctag acacactcca gtcaattcct 8820 

ggctaggcaa catcatcatg tatgcgccca ccttgtgggc aaggatgatc ctgatgactc 8880 

atttcttctc catccttcta gctcaggaac aacttgaaaa agccctagat tgtcagatct 8940 

acggggcctg ttactccatt gagccacttg acctacctca gatcattcaa cgactccatg 9000 

gccttagcgc attttcactc catagttact ctccaggtga gatcaatagg gtggcttcat 9060 
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gcctcaggaa acttggggta ccgcccttgc gagtctggag acatcgggcc agaagtgtcc 9120 

gcgctaggct actgtcccag ggggggaggg ctgccacttg tggcaagtac ctcttcaact 9180 

gggcagtaag gaccaagctc aaactcactc caatcccggc tgcgtcccag ttggatttat 9240 

ccagctggtt cgttgctggt tacagcgggg gagacatata tcacagcctg tctcgtgccc 9300 

gaccccgctg gttcatgtgg tgcctactcc tactttctgt aggggtaggc atctatctac 9360 

tccccaaccg atgaacgggg agctaaacac tccaggccaa taggccatcc tgtttttttc 9420 

cctttttttt tttctttttt tttttttttt tttttttttt ttttttttct cctttttttt 9480 

tcctcttttt ttccttttct ttcctttggt ggctccatct tagccctagt cacggctagc 9540 

tgtgaaaggt ccgtgagccg cttgactgca gagagtgctg atactggcct ctctgcagat 9600 

caagt 9605 



<210> 3 
<211> 10690 
<212> DNA 

<213> pHCVNeo.17 coding 
<400> 3 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 

tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 

cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 

gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 

gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 

gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 

ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 

cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 

ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 

acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 

cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 

tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 

aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 

cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 

ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 

ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 

gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 

tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 

ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 

agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 

gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 

cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 

ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 

aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 

gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500- 

aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 

gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 

tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 

atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 

aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 

atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 

agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 

acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 

ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 

caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 

ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 

ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 

ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 

acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 

cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 

gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 

gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 

gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
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accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 

tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 

atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 

ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 

gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 

aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 

aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 

ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 

ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 

ctggacccga- ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 

cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 

gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 

gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 

ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 

acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 

ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 

caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 

tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 

atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 

gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 

atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 

gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 

gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 

gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 

tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 

gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 

accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 

gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 

aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 

tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 

atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 

gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 

ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 

cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 

gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 

acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 

gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 

caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 

atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 

accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 

gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 

accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 

gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 

acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 

ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 

gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 

ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 

ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 

tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 

agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 

cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 

tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 

cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 

gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 

acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 

tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 

tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 

acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 

agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 

ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 

gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 

gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
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gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 

ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 

gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 

gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 

atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 

gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 

acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 

cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 

actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 

accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 

aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 

gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 

tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 

tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 

gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 

tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 

actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 

atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 

catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 

tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 

gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 

aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 

ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 

gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 

ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 

tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 

tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 

tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 

agatcaagta cttctagaga attctagctt ggcgtaatca tggtcatagc tgtttcctgt 8040 

gtgaaattgt tatcagctca caattccaca caacatacga gccggaagca taaagtgtaa 8100 

agcctgggat gcctaatgag tgagctaact cacattagtt gcgttgcgct cactgcccgc 8160 

tttccagtcg ggaaacctgt cgtgccagct ccattagtga atcgtccaac gcacggggag 8220 

aggcggtttg cgtattgggc gcacttccgc ttcctcgctc actgactcgc tgcgctcgtt 8280 

cgttcggctg cggcgagccg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 8340 

atcaggggat aacgcaggaa agaccatgtg agcaaaaggc cagcaaaagg ccaggaaccg 8400 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 8460 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 8520 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 8580 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 8640 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 8700 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 8760 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 8820 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 8880 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 8940 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 9000 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 9060 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 9120 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 9180 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 9240 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 9300 

ccccagtgct gcaatgatac cgcgagaacc acgctcaccc gcaccagatt tatcagcaat 9360 

aaaccagcca gccggaagtg cgctgcggag aagtggtcct gcaactttat ccgcctccat 9420 

ccagtctatt agttgttgcc gggaagctag agtaagtagt tcgccagtca gcagtttgcg 9480 

taacgtcgtt gccatagcaa caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 9540 

attcagctcc ggctcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 9600 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 9660 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 9720 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 9780 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 9840 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 9900 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 9960 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 10020 
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gacacggaaa 
gggttattgt 
ggttccgcgc 
gacattaacc 
tgacggtgaa 
ggatgccggg 
ctggcttaac 
aataccgcac 
ccgcaactgt 
agggggatgt 
ttgtaaaacg 
actcactata 



tgttgaatac 
ctcatgagcg 
acatttcccc 
tataaaaata 
aacctctgac 
agcaggcaag 
tatgcggcat 
agatgcgtaa 
tgggaagggc 
gctgcaaggc 
acagccaatg 



tcatactctt 
gatacatatt 
gaaaagtgcc 
ggcgtatcac 
acttgcagct 
cccgtcaggg 
cagagcagat 
ggagaaaata 
ggtcagtacg 
gattaagttg 
aattgaagct 



cctttttcaa 
tgaatgtatt 
acctgacgtc 
gaagcccttt 
cccgcagacg 
cgcgtcagtg 
tg tact gaga 
ccgcatcagc 
cgcttcttcg 
ggtaacgcca 
tattaattct 



tattattgaa 
tagaaaaata 
taagaaacca 
cgtctagcgc 
gtcacagctt 
ggtgttggcg 
gtacaccaga 
ctccattcgc 
ctattacgcc 
gggttttccc 
agactgaagc 



gcatttatca 
aacaaatagg 
ttattaccat 
gtttcggtga 
gtctgtaagc 
ggtgtcgggg 
tgcggtgtga 
cattcagact 
aactggcgaa 
aatcacgacg 
ttttaatacg 



10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10690 



<210> 4 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 4 

acatgatctg cagagaggcc agt 23 

<210> 5 
<211> 26 
<212> DNA 

<213> Primer oligonucleotide 
<400> 5 

gacasgctgt gatawatgtc tccccc 26 

<210> 6 
<211> 21 
<212> DNA 

<213> Primer oligonucleotide 
<400> 6 

tggctctcct caagcgtatt c 21 

<210> 7 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 7 

actctctgca gtcaagcggc tea 23 

<210> 8 
<211> 21 
<212> DNA 

<213> Primer oligonucleotide 



<400> 8 

cagtggatga aceggctgat a 21 

<210> 9 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 9 

ggggegaegg catcatgeaa acc 23 
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<210> 10 
<211> 23 
<212> DNA 

<213> Primer oligonucleotide 
<400> 10 

caggacctgc agtctgtcaa agg 23 

<210> 11 
<211> 17 
<212> DNA 

<213> Primer oligonucleotide 
<400> 11 

cgggagagcc atagtgg 17 

<210> 12 
<211> 19 
<212> DNA 

<213> Primer oligonucleotide 
<400> 12 

agtaccacaa ggcctttcg 19 



<210> 13 
<211> 21 
<212> DNA 
<213> Probe 



<400> 13 

ctgcggaacc ggtgagtaca c 



21 



